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@ Method for HLA-DP typing. 

Sm^hT^n'' ^."^ """^ ^^^^ ^ ""^'^'^ ^^^d ^°"t^^"*"9 sample obtained 

from the indrndual. which process comprises: (a) amplifying a target region of the nucleic acids in the sample 
under conditions suitable for carrying out a polymerase chain reaction, whereby the target region con aTs a 
SulT' H Th" ^" 9ene using a specific primer; (b) mixing the a« 

olemTnt^^^^^^^ ' T"' °' '^"^''^ oligonucleotide (SSO) probes, wherein each probe is com 

0 oZSh r T rr"'^ ' '"^^'"^ 9^"^' ""^^^ conditions Wherein SSO 

probes bind to said amplified nucleic acids to form stable hybrid duplexes only if they are exactly complemen- 

omhi. ff ? h^TmT *° ^^'^ oligonucleotide primers and to oligonucleotide 

ZTir f \ ^^"^ ^""^ ^'^'"^ ■''"^^ ^^-^^ ^P'"9 '^^t^^ods may be useful in the 

prevention of graft rejection and graft versus host disease, in determining susceptibility to autoimmune dteases 
m providing evidence concerning the derivation from an individual of forensic samples and in patemly teslT 
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This invention relates to a method and compositions for determining the "LA-DP genotype of an 
individual using gene amplification methodology as disclosed and claimed in U.S Patent No^. 4.6M.195 
and 4683.202 and the dot-blot and allele-specific oligonucleotide probe technology as disclosed and 
claimed In U.S. Patent No. 4.683.194. The methods and probes of the invention specifical y relate to the 
5 detection of the polymorphic class II HLA-DP genes. The invention relates to the fields of molecular biology, 
diaanostic medicine, and forensics. . . . ^ 

To aid in understanding the invention described below it is referred to the introductory part of fte 
Intemational Patent Application. Publication No. WO 89/11547. v^herein an introduction <^ds 
mentioned above is given and wherein the definition of the terms used is the same as the definition of the 
JO terms used in the present specification. , - ■ ^ 

WO 89/11547 discloses a process for determining an individual's HLA-DP genotype from a nucleic acid 
containing sample obtained from the individual comprising: (a) amplifying a target r^ion of said nucle|c 
acid which contains a polymorphic region of an HLA-DP gene; (b) hybridizing the amplified ""^ eic acid w^^^^ 
a panel of sequence specific oligonucleotide (SSO) probes specific for variant segments of HLA-DP genes 
,5 under conditions that allow each of said SSO probes to form a stable hybrid duplex with the amp^^fied 
nucleic acid only if the complementary sequence is contained in the amplified nucleic ac.d; and (c) 
detecting hybrids formed between the amplified nucleic acid and the SSO probes. . ^. , 

WO 89/11547 describes also kits useful for determining the HLA-DP genotype of an individual; these 
kits comprise a panel of SSO probes for allelic variant sequences in said target region; and (b) instructions 
20 for determining the genotype by utilizing kit Ingredients. ,u ui a nc <,onr>t»n« 

The present invention provides improved processes and reagents for determining the HLA-DP genotype 
of an individual from a nucleic acid containing sample obtained from said individual. The invention results 
from the discovery and characterization of further sequence polymorphisms in the variable second exon of 
the HLA-DPB1 alleles. As a result, further DPB1 (DPbeta) allelic variants have been discovered viz the 
25 alleles DPB21 DPB22. DPB23. DPB24. DPB25. DPB26. DPB27. DPB28, DPB29. and DPB30 described 
below Based upon the novel sequence of these DP genes, the sequences of SSO probes are provided for 
the detection of the variant genotypes. The variations between the different DP alleles are dispe^^d 
Therefore, one probe alone is rarely able to identify uniquely a specific DPB1 "'^t ^'nS 

an allele is inferred from the pattern of binding of a panel of probes, with each individual probe of the panel 
30 specific to different segments of a DPB1 gene. 

More specifically the present invention relates to a process for determining an individuals HLA DP 
genotype from a nucleic acid containing sample originating from the individual whose HLA-DP genotype is 
To be determined, which method comprises amplifying a target region of the nucleic acds in the sample 
under conditions suitable for carrying out a polymerase chain reaction, whereby the target region contains a 
35 polymorphic region (variable segment) of an HLA DP gene using a primer selected from the group 
consisting of the primers 



ABUl SEQIDN0:91 5'GGGATCCGAGAGTGGCGCCTCCGCTCAT, 
RS348 SEQIDNO:142 5'ACACAGGAAACAGCTATGACCATG, and 
RS349 SEQIDNO:143 5'CCAGGGTTTTCCCAGTCACGAC; 



45 mixing the amplified nucleic acids with a panel of sequence specific oligonucleotide (SSO) probes, wherein 
each probe is complementary to a variant sequence of a variable segment of an HLA DP gene, under 
conditions wherein SSO probes bind to said amplified nucleic acids to form stable hybrid duplexes only if 
they are exactly complementary: and detecting hybrids fonned between the amplified nucleic acids and the 
SSO probes. Said probes are preferably selected from the group consisting of: 



so 



SEQ ID NO: 92 (5 CCTGATGAGGTGTACTG); 
SEQ ID NO: 99 (5 GAATTACCTTTTCCAGGGA); 
SEQ ID NO: 100 (5'ATTACGTGTACCAGTTACG); 
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SEQ ID NO: 101 (5'CGTCCCTGGTACACGTAAT); 
SEQ ID NO: 102 (5 CCTGCTGCGGAGTACTG); 
SEQ ED NO: 103 (5'CAGTACTCCTCATCAGG); 
SEQ ID NO: 104 (5'CAGTACTCCGCCTCAGG); 
SEQ ID NO: 105 (5 CCTGATGAGGACTACTG); 
SEQ ID NO: 106 (5'GACATCCTGGAGGAGAAGC); 
SEQ ID NO: 107 (5 GCTCCTCCTCCAGGATGTC); 
SEQ ID NO: 108 (5 GACCTCCTGGAGGAGAAGC); 
SEQ ID NO: 109 (5 GCTCCTCCTCCAGGAGGTC); 
SEQ ID NO: 110 (5'ATTACGTGCACCAGTTACG); 
SEQ ID NO: 111 (5'CGTAACTGGTACACGTAAT); 
SEQ ID NO: 112 (5 CTGCAGGGTCATGGGCCCCCG); 
SEQ ro NO: 113 (5'CTGCAGGGTCACGGCCTCGTC); 
SEQ ID NO: 114 (5'ATTACGTGTACCAGTTA); 
SEQ ID NO: 115 (5 CCTGAGGCGGAGTACTG); 
SEQ ID NO: 116 (5'GACCTCCTGGAGGAGGAG); 
SEQ ID NO: 117 (5 GACCTCCTGGAGGAGAGG); 
SEQ ID NO: 118 (5'CTGGTCGGGCCCATGACC); 
SEQ ID NO: 119 (5'GAATTACCTTTTCCAGGGAC); 
SEQ ID NO: 120 (5TTACGTGTACCTGGGAC); 
SEQ ID NO: 121 (5 ACATCCTGGAGGAGAAGC); 
SEQ ID NO: 122 (5 ACATCCTGGAGGAGGAGC); 
SEQ ID NO: 123 (5 ACCTCCTGGAGGAGAAGC); 
SEQ ID NO: 124 (5'CCTGATGAGGAGTACTG); 
SEQ ID NO: 125 (5'CTGGGCGGGCCCATG); 
SEQ ID NO: 126 (5'CTGGACGAGGCCGTG); 
SEQ ID NO: 127 (5'GACCTCCTGGAGGAGGAGC); 
SEQ ID NO: 128 (5'GACCTCCTGGAGGAGAGGC); 
SEQ ID NO: 129 (5'AGCTGGGCGGGCCCATGAC); 
SEQ ID NO: 130 (5'AGCTGGACGAGGCCGTGAC); 
SEQ ID NO: 132 (5'ATTACGTGCACCAGTTAC); 
SEQ ID NO: 133 (5'ATTACGTGCACCAGTTA); 
SEQ ID NO: 134 (5'CAGTACTCCTCATCAG); 
SEQ ID NO: 135 (5'CTGATGAGGACTACTG); 
SEQ ID NO: 137 (5*CCGTCCCTGGAAAAGGTAATTC); 
SEQ ID NO: 138 (5*GACCTCCTGNGAGGAGAGGC); 
SEQ ID NO: 139 {5'GACCTCCTGGAGNGAGGAGC); 
SEQ ID NO: 151 (X-AGGAGTTCGCGCGCTT); 
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SEQ ID NO: 152 (X-AGGAGTTCGTGCGCTT); 
SEQ ID NO: 153 (X-AGGAGCTCGTGCGCTTC); 
SEQ ID NO: 154 (X-CCGGCAGGAGTACGCGC); 
SEQ ID NO: 155 (X-GAGGAGTACGCGCGCT); 
SEQ ID NO: 156 (X-CGAGCTGGGCGGGCCCA); and 
SEQ ID NO: 157 (X-CGAGCTGGTCGGGCCCA), 

10 

more preferably they are selected from tfie group of novel probes havirtg the nucleic acid sequences: 

SEQ ID NO: 92 (5'CCTGATGAGGTGTACTG); 
SEQ ID NO: 132 (5'ATTACGTGCACCAGTTAC); 
SEQ ID NO: 133 (5'ATTACGTGCACCAGTTA); 
SEQ ID NO: 134 (5'CAGTACTCCTCATCAG); 
20 SEQ ID NO: 135 (5'CTGATGAGGACTACTG); 

SEQ ID NO: 137 (5'CCGTCCCTGGAAAAGGTAATTC); 
SEQ ID NO: 138 (5'GACCTCCTGNGAGGAGAGGC); 
SEQ ID NO: 139 (5 GACCTCCTGGAGNGAGGAGC); 
SEQ ID NO: 151 (X-AGGAGTTCGCGCGCTT); 
SEQ ID NO: 152 (X-AGGAGTTCGTGCGCTT); 
SEQ ID NO: 153 (X-AGGAGCTCGTGCGCTTC); 
SEQ ID NO: 154 (X-CCGGCAGGAGTACGCGC); 
SEQ ID NO: 155 (X-GAGGAGTACGCGCGCT); 
SEQ ID NO: 156 (X-CGAGCTGGGCGGGCCCA); and 
35 SEQ ID NO: 157 (X-CGAGCTGGTCGGGCCCA). 

In the preferred process in accordance with the present invention the panel of probes comprises probes 
with hybridizing regions: 

40 



25 



30 
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SEQ ID NO: 88 (5TGTCTGCACATCCTGTCCG); 
SEQ ID NO: 89 (5TGTCTGCATACCCTGTCCG)| 
SEQ ID NO: 90 (5 CGGACAGGATATGCAGACA); 
SEQ ID NO: 99 (5'GAATTACCTTTTCCAGGGA); ' 
SEQ ID NO: 101 (5'CGTCCCTGGTACACGTAAT); 
SEQ ID NO: 102 (S'CCTGCTGCGGAGTACTG); 
SEQ ID NO: 105 (5 CCTGATGAGGACTACTG)'; 
SEQ ID NO: 106 (5 GACATCCTGGAGGAGAAGC); 
SEQ ID NO: 107 (5 GCTCCTCCTCCAGGATGTC); 
SEQ ID NO: 108 (5 GACCTCCTGGAGGAGAAGC); 
SEQ ID NO: 110 (5*ATTACGTGCACCAGTTACG); ' 
SEQ ID NO: 112 (5'CTGCAGGGTCATGGGCCCCCG); 
SEQ ID NO: 114 (5'ATTACGTGTACCAGTTA); 



SEQ ID NO: 115 (5 CCTGAGGCGGAGTACTG); 
25 SEQ ID NO: 116 (5 GACCTCCTGGAGGAGGAG); 

SEQ ID NO: 117 (5 GACCTCCTGGAGGAGAGG); 
SEQ ID NO: 126 (5'CTGGACGAGGCCGTG); and 
SEQ ID NO: 131 (5'CCAGTACTCCTCATCAGGC), 

most prefereably said panel of probes further comprises a probe with hybridising region SEQ ID NO- 92 
(5'CCTGATGAGGTGTACTG). 

The present invention relates also to the novel primers 

35 

ABlll SEQ ID NO: 91 5 GGGATCCGAGAGTGGCGCCTCCGCTCAT, 
RS348 SEQ ID NO: 142 5'ACACAGGAAACAGCTATGACCATG, and 
^ RS349 SEQ ID NO: 143 5'CCAGGGTTTTCCCAGTCACGAC per se, 

and to the novel probes mentioned above per se. especially when used as a diagnostic tool, e.g. for 
determining an individuaPs HLA DP genotype, such as in the case wherein the individual's HLA DP 
genotype comprises an allele selected from the group consisting of DPB21, DPB22, DPB23, DPB24 

45 DPB25, DPB26, DPB27, DPB28, DPB29, and DPB30- 

Because the present invention also is compatible with amplified nucleic acids, and because the PGR 
technique can amplify extremely small quantities of nucleic acid, samples containing vanishingly small 
amounts of nucleic acid can be typed for the presence of particular HLA-DP variants by the method of the 
present invention. For instance, even a single hair, contains enough DNA for purposes of the present 

50 invention, as evidenced by the work with DQalpha described by Higuchi et al., 1988, Nature 332:543-546. 

In general, the nucleic acid in the sample will be DNA, most usually genomic DNATRowever, the 
present invention can also be practiced with other nucleic acids, such as messenger RNA or cloned DNA. 
and the nucleic acid may be either single-stranded or double-stranded in the sample and still be suitable for 
purposes of the present invention. Those of skill in the art recognize that whatever the nature of the nucleic 

55 acid, the nucleic acid can be typed by the present method merely by taking appropriate steps at the 
relevant stage of the process. When PGR is used to amplify the nucleic acid in the sample, then the sample 
will usually comprise double-stranded DNA when typed with the novel probes of the invention. 
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As noted at)ove, the HLA-DP typing method and probes of the invention are used in conjunction with 
PCR-amplified target DNA. Those practicing the present invention should note, however, that amplification 
of HLA-DP target sequences in a sample may be accomplished by any known method which provides 
sufficient amplification so that the target sequence may be detected by nucleic acid hybridization to an SSO 

5 probe. Although the PGR process is well known in the art (see U.S. Patent Nos. 4,683,195 and 4.683,202) 
and although a variety of commercial vendors, such as Perkin Elmer (Norwalk, CT), sell PGR reagents and 
publish PGR protocols, some general PGR information is provided below for purposes of clarity and full 
understanding of the invention to those unfamiliar with the PGR process. 

To amplify a target nucleic acid sequence in a sample by PGR. the sequence must be accessible to the 

10 components of the amplification system. In general, this accessibility is ensured by isolating the nucleic 
acids from the sample. A variety of techniques for extracting nucleic acids from biological samples are 
known In the art. For example, see those described in Maniatis et al.. Molecular Cloning: A Laboratory 
Manual, (New York, Gold Spring Harbor Laboratory, 1982). Alternatively, if the sample is fairly readily 
disruptable, the nucleic acid need not be purified prior to amplification by the PGR technique, i.e., if the 

75 sample is comprised of cells, particularly peripheral blood lymphocytes or amniocytes, lysis and dispersion 
of the intracellular components may be accomplished merely by suspending the cells in hypotonic buffer. 

Because the nucleic acid in the sample is first denatured (assuming the sample nucleic acid Is double- 
stranded) to begin the PGR process, and because simply heating some samples results In the disruption of 
cells, isolation of nucleic acid from the sample can sometimes be accomplished In conjunction with strand 

20 separation. Strand separation can be accomplished by any suitable denaturing method including physical, 
chemical, or enzymatic means. Typical heat denaturation involves temperatures ranging from about 80*G to 
105*G for times ranging from about 1 to 10 minutes. Strand separation may also be induced by a helicase, 
an enzyme capable of exhibiting helicase activity. For example, the enzyme RecA has helicase activity in 
the presence of ATP. The reaction conditions suitable for strand separation by helicases are known in the 

25 art (see Kuhn Hoffman-Berling, 1978, GSH-Quantitative Biology 43:63, and Radding, 1982. Ann. Rev . 
Genetics 16:405-436), 

Ai~noted above strand separation may be accomplished in conjunction with the isolation of the sample 
nucleic acid or as a separate step. In this embodiment of the PGR process, the reaction is catalyzed by a 
heat-stable polymerase and carried out at an elevated temperature. The temperature is one at which the 

30 enzyme is thermostable, and at which the nucleic acids are In an equilibrium of single and double strands, 
so that sufficient primer will anneal to template strands to allow a reasonable rate of polymerization. In the 
preferred embodiment of the PGR process, however, strand separation is achieved by heating the reaction 
to a sufficiently high temperature for an effective time to cause the denaturation of the duplex, but not to 
cause an irreversible denaturation of the polymerase (see European Patent Application. Publication No. 

35 258.017, incorporated herein by reference). 

Once the sti-ands are separated, the next step In PGR Involves hybridizing the separated strands with 
primers that flank the target sequence. The primers are then extended to form complementary copies of the 
target strands, and the cycle of denaturation, hybridization, and extension is repeated as many times as 
necessary to obtain the desired amount of amplified nucleic acid. 

40 As noted above, the present invention also provides PGR novel primers for HLA-DP DNA amplification 
and typing. These primers are complementary to sequences in the conserved regions that flank the target 
sequences in the variant regions of the HLA-DP loci. For purposes of the present invention, the preferred 
variant region of the HLA-DP loci is the second exon of the DPA1 and DPB1 genes. For successful PGR 
amplification, the present primers are designed so that the position at which each primer hybridizes along a 

45 duplex sequence is such that an extension product synthesized from one primer, when It is separated from 
its template (complement), serves as a template for the extension of the other primer to yield an amplified 
segment of nucleic acid of defined length. Moreover, primers are provided that will bind preferentially to the 
HLA-DP region under selective annealing conditions. Preferred primers are 

UG19 SEQ ID NO: 144 (GCTGCAGGAGAGTGGCGCCTCCGCTCAT) and 
UG21 SEQ ID NO: 145 (CGGATCCGGCCCAAAGCCCTCACTC), 

55 Template-dependent extension of primers in PGR is catalyzed by a polymerizing agent in the presence 
of adequate amounts of four deoxyribonucleoside triphosphates (dATP, dGTP, dGTP, and dTTP or dUTP) in 
a reaction medium comprised of the appropriate salts, metal cations, and pH buffering system. Suitable 
polymerizing agents are enzymes known to catalyze template-dependent DNA synthesis. For example, if 
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the template is RNA, a suitable polymerizing agent to convert the RNA into a complementary DNA (cDNA) 
sequence is reverse transcriptase (RT), such as avian myeloblastosis virus RT. Once the target for 
amplification is DNA, suitable polymerases include, for example, E. coll DNA polymerase I or its Klenow 
fragment, T4 DNA polymerase, and Taq polymerase, a heat stable "DNXpolymerase isolated from Thermus 

5 aquaticus and commercially available from Perkin-Elmer (Norwalk, CT). The latter enzyme is widely used in 
the amplification and sequencing of nucleic acids. The reaction conditions for using DNA polymerases are 
known in the art, and are described in. for example, the treatise Methods in Enzymology, and in Maniatis et 
al.. Molecular Cloning : A Laboratory Manual, supra. ~ 
The PGR method can be performed in a step-wise fashion, where after each step new reagents are 

10 added, or in a fashion where all of the reagents are added simultaneously, or in a partial step-wise fashion, 
where fresh or different reagents are added after a given number of steps. For example, if strand separation 
is induced by heat, and the polymerase is heat-sensitive, then the polymerase will have to be added alter 
every round of strand separation. However, if, for example, a helicase is used for denaturation, or if a 
thermostable polymerase is used for extension, then all of the reagents may be added initially, or, 

75 alternatively. If molar ratios of reagents are of consequence to the reaction, the reagents may be 
replenished periodically as they are depleted by the synthetic reaction. 

Due to the enormous amplification possible with the PGR process, small amounts of DNA carried over 
from other samples, positive control templates, or from previous amplifications can provide enough template 
to result in PGR product, even in the absence of added template DNA. If possible, all reaction mixes are set 

20 up in an area separate from PGR product analysis and sample preparation. The use of dedicated or 
disposable vessels, solutions, and pipettes (preferably positive displacement pipettes or plugged pipette 
tips) for RNA/DNA preparation, reaction mixing, and sample analysis will minimize cross contamination. See 
also Higuchi and Kwok. 1989, Nature . 339 : 237-238. and Kwok. and Orrego, in Innis et al. eds.. 1990 PGR 
Protocols: A Guide to Methods and Applications . Academic Press. Inc., San Diega GA, which~ari 

25 incorporated herein by reference. 

One particular method for minimizing the effects of cross contamination of nucleic acid amplification is 
described in International Patent Application, Publication No. WO 92/01814, which is incorporated herein by 
reference. The method involves the introduction of unconventional nucleotide bases, such as dUTP, into the 
amplified product. Exposure of the amplification product to enzymatic and/or physical-chemical treatment 

30 renders the product DNA incapable of serving as a template for subsequent amplifications. For example. 
uracil-DNA glycosylase will remove uracil residues from PGR product containing that base. Enzyme 
treatment of a PGR reaction mixture prior to amplification results in the degradation of any contaminating 
uracil-containing PGR product from a prior reaction and serves to "sterilize" the amplification reaction. 

It is preferable, but not essential, to add the thermostable DNA polymerase to the reaction mix after 

35 both the primer and the template are added. By separating at least one component that is essential for 
primer extension, the initiation of polymerization can be controlled and non-specific primer hybridization and 
extension is minimized. Polymerization initiation can also be controlled by delaying the addition of MgGb. A 
modification of PGR referred to as "hot start" is described in International Patent Application, Publication 
No. WO 91/12342, which is incorporated herein by reference. In hot start PGR, the addition of polymerase 

40 is delayed until the initial high-temperature denaturation step, thereby minimizing the formation of extension 
products from non-specific primer hybridization which may occur if all reaction components are added at 
room temperature. 

Those skilled in the art will know that the PGR process is usually carried out as an automated process 
with a thermostable enzyme. In this process, the reaction mixture is cycled through a denaturing region, a 

45 primer annealing region, and a reaction region. A machine specifically adapted for use with a thermostable 
enzyme is disclosed more completely in European Patent Application, Publication No. 236,069. incorporated 
herein by reference, and is commercially available from Perkin Elmer (Norwalk, CT). 

One reason, as noted above, the PGR process is important in the method of the present invention is 
that the PGR process can be used to amplify the sample nucleic acid prior to HLA-DP DNA typing. Another 

60 important use of PGR for purposes of the present invention, however is for determining the nucleotide 
sequence of previously undiscovered allelic variants which exist in the HLA-DP region, so that probes for 
those variants can be constructed and used in the present method. In this use of the PGR process, 
polymorphic regions of the DPA1 and DPB1 genes are amplified, and the nucleotide sequences of these 
polymorphic target regions, for example, the second exon of the DPA1 and DPB1 genes, are determined. 

55 As illustrated below, it is also useful for the cells containing a particular variant to be typed by serological 
typing, mixed lymphocyte typing, or primed lymphocyte typing to correlate the nucleotide sequence of a 
particular variant with the DP type established by prior art methods. 
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Analysis of the nucleotide sequence of the target region of a DP variant allele can be readily performed 
by direct analysis of the PGR products. A preferred sequencing protocol is described in Innis et al.. 1988, 
Proc. Natl. Acad. Sci. 85:9436-9440, and U.S. Patent No. 5,075,216, incorporated herein by reference. A 
piwesTTorliirect "sequence analysis of PGR amplified products is also described by Saiki et al.. 1988, 
Science 239:487-491. Alternatively, the amplified target sequence may be cloned prior to sequence 
analysts, asdescribed by Scharf et al., 1986. Science 233:1076-1078. 

As discussed in WO 89/11547. a panel of DPw typed cells, representing a large number of different 
haplotypes, has been analyzed by PGR and nucleotide sequencing of the second exon of the DPB1 gene. 
As a result of this effort and similar efforts using samples obtained from a variety of individuals suffering 
from autoimmune diseases, a large number of different allelic variants in this locus were discovered. In 
general, the results demonstrated that specific DPB1 sequences correlate with the standard PLT-defined 
DPwl through DPw6 specificities. The rare exceptions may reflect the difficulty of obtaining a standard and 
reproducible PLT DPw typing system, which difficulties highlight the advantages of the present method. In 
this regard, it is relevant that the cell line Gox. originally typed as DPwl, has been retyped as DPw3 and 
contains the DPB3 allele (now DPBITOOl, the new nomenclature is listed elsewhere). 

Thus, the present invention also relates to a process for determining an individual's susceptibility to an 
autoimmune disease comprising determining the individual's HLA DP genotype in accordance with the 
present invention and determining whether the individual's genotype is a genotype which is linked to an 
autoimmune disease. Said autoimmune disease is either linked to the DPB2.1 allele, such as in the case of 
pauciarticular juvenile rheumatoid arthritis and insulin dependent diabetes mellitus (IDDM). or to an allele 
selected from the group consisting of DPB13. DPB1. DPB3, and DPB4.2, such as in the case of coeliac 
disease (GD). 

For other interesting and important correlations between the serologically defined DP types and DP 
variant nucleotide sequences and for descriptions of how to perform HLA-DNA typing on the DPA1 
(previously DPalpha) locus and on the DPB1 (previously DPbeta) locus see WO 89/11547. 

The DNA sequences of the DPA1 and DPB1 genes serve as a useful starting point in the design of the 
sequence specific oligonucleotide probes of the present invention. These probes are designed so that, 
under stringent hybridization conditions, the probes hybridize specifically only to exactly complementary 
sequences in variant segments of the DPA1 and DPB1 alleles. These SSO probes may be of any length 
which spans the variant sequences in a variant region and allows for sequence specific hybridization, but 
preferably the hybridizing region of the probe is short, in the range of 10 to 30 bases, and more preferably 
is about 17 to 19 bases in length. For immobilization, the probe may also contain long stretches of poly-T 
which can be fixed to a solid support by irradiation, a technique described in more detail in international 
Patent Application, Publication No. WO 89/11548 and in European Patent Application, Publication No. 
237,362. The disclosures of these applications are incorporation herein by reference. 

The SSO probes of the invention are also designed to hybridize specifically with a particular variant 
segment of a DP allele and to have destabilizing mismatches with the other variant sequences known for 
the particular segment. Preferably, the probes are specific for variant DNA segments in the variable second 
exons of the DPA1 and DPB1 genes, and even more preferably, the probes are specific for DNA segments 
encoding the residues near positions 8-11, 36. 55-57. 65-69. 76. and 84-87 of the second exon. 
Oligonucleotide probes which have been designed to hybridize specifically to the second exons of the 
DPB1 and DPA1 alleles are described in more detail below and in the Examples. 

The probes of the invention can be synthesized and labeled using the techniques described above in 
the discussion of PGR primers. For example, the probe may be labeled at the 5'-end with ^^p by incubating 
the probe with ^^p-atp and kinase. A suitable non-radioactive label for SSO probes is horseradish 
peroxidase (HRP). IVIethods for preparing and detecting probes containing this label are described in the 
Examples below and in European Patent Application, Publication No. 237.362, and incorporated herein by 
reference. For additional information on the use of such labeled probes, see U.S. Patent No. 4,789,630; 
Saiki et al., 1988, N. Eng. J. Med. 319:537-541; and Bugawan et al.. 1988, Bio/Technology 6:943-947, 
incorporated herein~by"7eference. Useful chromogens include red leuco dye and tetramethyl benzidine 
(TIVIB). 

The probes of the invention can be used to identify the allelic sequences present in a sample by 
determining which of the SSO probes bind to the HLA-DP sequences present in the sample. Suitable assay 
methods for purposes of the present invention to detect hybrids formed between SSO probes and nucleic 
acid sequences in a sample are known in the art. For example, the detection can be accomplished using a 
dot blot format, as described in the Examples. In the dot blot format, the unlabeled amplified sample is 
bound to a membrane, the membrane incubated with labeled probe under suitable hybridization conditions, 
unhybridized probe removed by washing, and the filter monitored for the presence of bound probe. When 
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multiple samples are analyzed with few probes, a preferred process requires high stringency hybridization 
and wash conditions, allowing only perfectly matched hybrids to exist. 

An alternative method is a "reverse" dot blot format, in which the amplified sequence contains a label. 
In this format, the unlabeled SSO probes are bound to the membrane and exposed to the labeled sample 
5 under appropriately stringent hybridization conditions. Unhybridized labeled sample is then removed by 
washing under suitably stringent conditions, and the filter is then monitored for the presence of bound 
sequences. 

In another version of the "reverse" dot blot format, the SSO probe is labeled, and the sample nucleic 
acid is unlabeled. After hybridization and washing, the labeled probe, or a labeled fragment of the probe, is 
10 released from the membrane and detected to determine if a sequence in the sample hybridized to the 
labeled oligonucleotide. Release of label can be achieved by digestion with a restriction enzyme that 
recognizes a restriction site in the duplex hybrid. This procedure, known as oligomer restriction, is 
described more fully in U.S. Patent No. 4,683,194 and corresponding EP Patent Publication 164,054, each 
of which are incorporated herein by reference. 

75 Whatever the method for determining which DP SSO probes of the invention hybridize to DP 
sequences in a sample, the central feature of the DP-DNA typing method involves the identification of the 
HLA-DP alleles present in the sample by analyzing the pattern of binding of a panel of SSO probes. 
Although single probes of the invention can certainly be used to provide useful information, the variation in 
the DPB1 alleles is dispersed in nature, so rarely is any one probe able to identify uniquely a specific DP 

20 variant. Rather, as shown in the Examples, the identity of an allele is inferred from the pattern of binding of 
a panel of SSO probes, which are specific to different segments of the DPA1 and DPB1 genes. 

DNA typing of HLA-DP alleles is useful for many different purposes, e.g. for determining an Individual's 
susceptibility to autoimmune diseases linked to certain HLA-DP alleles, as discussed in detail in WO 
89/11547 and below. It is anticipated that as medical technology develops, more disease or disease-prone 

25 states, including Grave's disease, S.L.E., and Shogren's Syndrome, will become known to be associated 
with various DP alleles. The present invention provides methods for distinguishing such alleles from other 
alleles and so provides a means to identify individuals at high risk for an autoimmune disease. In a 
preferred embodiment, an individual whose susceptibility is to be determined is analyzed for HLA-DP type 
first by using the PGR method to amplify the target region of the HLA-DP locus. Then, SSO probes are 

30 hybridized to the amplified target region, and the particular DP allele present in the amplified DNA is 
determined from the pattern of binding of the SSO probes. Finally, one determines whether the allele 
present in the amplified DNA is an allele associated with the autoimmune disease. 

The present method, however, is not limited to the field of medical science in ability to provide 
significant benefits. DNA typing methods also now play a significant role in the important area of individual 

35 identification, whether for solving crimes, as when the identity of a criminal or victim is established by 
linking an individual with evidence left at the scene of a crime, or for solving other issues of a non-criminal 
nature, as when biological material is used to determine the maternity or paternity of an individual. Thus, 
the present invention also relates to a process of providing forensic evidence concerning the derivation of a 
sample which contains genomic nucleic acids comprising determining the HLA DP genotypes of the sample 

40 and of a suspected individual, comparing the HLA DP genotypes of the individual and of the sample, and 
deducing whether the sample could have been derived from the individual. 

Whatever the purpose for which the present invention is employed, the differences between various DP 
alleles is key to the success of the method. Amino acid sequences from each of the DP alleles (and the 
SXA and SXB pseudoalleles) is provided in the Sequence Listing section; nucleotide sequences are 

45 provided for the DPB1 alleles. Shown below are the sequence identification numbers for the nucleic acid 
and amino acid sequences for each of the alleles. Two equivalent designations are given for each of the 
DPB1 alleles; the second is the official name assigned by the WHO Nomenclature Committee. 



Allele 


(WHO) 


Amino Acid Sequence 


DPA1 


DPArOIOI 


SEQ ID NO: 1 


DPA2 


DPAr0201 


SEQ ID NO: 2 


SXA 




SEQ ID NO: 3 


SXB 




SEQ ID NO: 4 
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Allele 


iMucieouae oequence 


Am inn Arid Sequence 


DPB1 


DPBr0101 


SEQ ID NO: 5 


SEQ ID NO: 6 


DPB2.1 


DPB1-0201 


SEQ ID NO: 7 


SEQ ID NO: 8 


DPB2.2 


DPB1*0202 


SEQ ID NO: 9 


SEQ ID NO: 10 


DPB3 


DPBr0301 


SEQ ID NO: 11 


SEQ ID NO: 12 


DPB4-1 


DPB1*0401 


SEQ ID NO: 13 


SEQ ID NO: 14 


DPB4.2 


DPB1W02 


SEQ ID NO: 15 


SEQ ID NO: 16 


DPB5 


DPBrO501 


SEQ ID NO: 17 


SEQ ID NO: 18 


DPB6 


DPBroeoi 


SEQ ID NO: 19 


SEQ ID NO: 20 


DBP8 


DPB1"0801 


SEQ ID NO: 21 


SEQ ID NO: 22 


DPB9 


DPB1*0901 


SEQ ID NO: 23 


SEQ ID NO: 24 


DPB10 


DPBnOOl 


SEQ ID NO: 25 


SEQ ID NO: 26 


DPB11 


DPBri101 


SEQ ID NO: 27 


SEQ ID NO: 28 


DPB13 


DPBri301 


SEQ ID NO: 29 


SEQ ID NO: 30 


DPB14 


DPBri401 


SEQ ID NO: 31 


SEQ ID NO: 32 


DPB15 


DPBIMSOI 


SEQ ID NO: 33 


SEQ ID NO: 34 


DPB16 


DPBneoi 


SEQ ID NO: 35 


SEQ ID NO: 36 


DPB17 


DPBri701 


SEQ ID NO: 37 


SEQ ID NO: 38 


DPB18 


DPBri801 


SEQ ID NO: 39 


SEQ ID NO: 40 


DPB19 


DPBri901 


SEQ ID NO: 41 


SEQ ID NO: 42 


DPB20 


DPBr2001 




SEQ ID NO: 43 


DPB21 


DPBr2801 


SEQ ID NO: 44 


SEQ ID NO: 45 


DPB22 


DPBr3101 


SEQ ID NO: 46 


SEQ ID NO: 47 


DPB23 


DrDi 2701 


CCA in MO' Afl 


<?FO ID NO- 49 


, DPB24 


DPBr3201 


SEQ ID NO: 50 


SEQ ID NO: 51 


DPB25 


DPBr3301 


SEQ ID NO: 52 


SEQ ID NO: 53 


DPB26 


DPBr3401 


SEQ ID NO: 54 


SEQ ID NO: 55 


DPB27 


DPBr2901 


SEQ ID NO: 56 


SEQ ID NO: 57 


DPB28 


DPBr3001 


SEQ ID NO: 58 


SEQ ID NO: 59 


DPB29 


DPBr3501 


SEQ ID NO: 60 


SEQ ID NO: 61 


DPB30 


DPBr2101 


SEQ ID NO: 62 


SEQ ID NO: 63 



The sequence information provided above is repeated in the amino acid and nucleotide sequence 
alignment Tables below in a form which permits easy visual inspection. 

The most significant differences between DP alleles can be detected quite readily when the various 
amino acid sequences encoded by the alleles are aligned and examined. Such an alignment is shown 
below, where a dash indicates identity with the DPB4.1 allele (for the various DPB1 alleles and the DPB1 
pseud'ogene, designated SXB) or with the DPA1 allele (for the DPA1 allele, DPA2, and the DPalpha 
pseudogene, designated SXA). In this depiction, the numbered positions are for the mature peptide 
subunits, allele designations are at left, and representative cell sources are at right. 
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To detect and distinguish between DP alleles in a practicable and economic fashion, however, one must 
55 know the nucleotide sequence of the alleles. Portions of the nucleotide sequences of various DPA1 and 
DPB1 alleles are shown below; the sequences are identified as above. The illustrative primers of the 
invention enable production of DNA from which the sequence of codons 8 to 90 can be determined. The 
location of allele sequences that are the preferred target sequences for hybridization with the various probes 
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of the invention are designated as j-A-j, j-B-j. |-C-|. j-D-j, j-E-|. and j-F-j. 
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The DNA sequences provided above are an important aspect of the present invention. Although only 
one strand of the sequence is shown, those of skill In the art recognize that the other strand of the 
sequence can be inferred from the Information depicted above. This information enables the construction of 
the probes of the invention. Many illustrative probes of the invention are shown in the Examples below. 
However, suitable SSO probes for hybridization analysis of the DPB1 alleles will comprise (or be 
complementary to) certain polymorphic sequences. Six sets of illustrative probes of the invention are 
depicted below; each set is designed to distinguish between the polymorphisms in a specific region of the 
second exon of the HLA-DPB1 gene. The designation of the regions is as described above. The 
polymorphic residues encoded within the allelic variant in the segment to which a probe hybridizes is shown 
in one letter amino acid code (the dash means that the non-polymorphic prototype residue(s) present in 
those positions) to the left of the probe sequence. The probes span the regions encoding the polymorphic 
amino acid residues and are shown as having a length of about 18 nucleotides. Those sequences in the 
probe that encode the polymorphic amino acid residues, and thus must be included within a probe for 
detecting the alleles that encode the designated segment, are between the slash marks in the sequence. 
The DP alleles with which the probe will hybridize are shown to the right of the probe. 
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Amino Acid 



HLA-DPBl SSQ Probes 

Probe Sequence 



Segment A: 

LF-G: SEQIDNO:64 TAC\CTTTTCCAGGG\ACGG 

VY-L: SEQIDNO:65 TAC\GTGTACCAGTT\ACGG 

VY-G: SEQIDNO:66 TAC\GTGTACCAGGG\ACGG 

VH-L: SEQIDNO:67 TAC\GTGCACCAGTT\ACGG 

Segment B: 

E-FA: SEQroNO:68 CGG\GAGGAGTTCGC\GCGC 

E-FV: SEQIDNO:69 CGG\GAGGAGTTCGT\GCGC 



E-LV: SEQIDNO:70 
Q-YA: SEQIDN0:71 
E-YA: SEQroNO:72 

Segment C: 

AAE: SEQIDNO:74 

DEE: SEQ ED NO: 75 
EAE: SEQ ID NO: 76 
DED: SEQ ID NO: 77 

S^mentD: 

I-K: SEQ ID NO: 78 
I-E: SEQ ID NO: 79 

L-K: SEQ ID NO: 80 
L-E: SEQ ID NO: 81 
L-R: SEQ ID NO: 82 

Segment E: 

M: SEQ ID NO: 83 



CGG\GAGGAGCTCGT\GCGC 
CGG\CAGGAGTACGC\GCGC 
CGG\GAGGAGTACGC\GCGC 



CCTG\CTGCGGAG\TACTGG 

CCTG\ATGAGGAG\TACTGG 
CCTG\AGGCGGAG\TACTGG 
CCTG\ATGAGGAC\TACTGG 



DPBl Alleles 



2.1, 2.2, 4.1, 4.2, 5, 

8, 16, 19, 21, 22, 

24, 25, 26 

3, 6, 11, 13, 20, 23, 

27,30 

1, 15, 18 

9, 10, 14, 17, 28, 29 



4.1, 21, 22, 25 

21,3,4^6^8^^ 
iaHlfikl7,iak 
la 2a 24 27, 28k 29 
2.2, 5, 26, 30 
11,15 
1, 13, 23 



1,4.1, 11, 13, 15,22, 
23,25,26 

21, 42 a la 1611^21 

225via28v30 

a 6^811417, 20^ 27, 29 



GAC\ATCCTGGAGGAGAA\G 
GAC\ ATCCTGGAGGAGGAN G 

GAG \CTC CTGGAGGAGAAX G 
GAC\CTCCTGGAGGAGGA\G 
GAC \ CTCCTGG AGGAGAGN G 



GACAGG\ATG\TGCAGACAC 



V : SEQ ID NO: 84 GACAGG\GTA\TGCAGACAC 
I: SEQ ID NO: 150 GACAGG\ATA\TGCAGACAC 
Segment F: 

GGPM: SEQ ID NO: 85 CTGG\GCGGGCCCA\TGACC 
DEAV: SEQ ID NO: 86 CTGG\ACGAGGCCG\TGACC 

VGPM: SEQ ID NO: 87 CTGG\TCGGGCCCA\TGACC 



I, 41,425,1a 2a 29 

2i,22aaiDiiam 

^142^21,2225 
6127 

II, 15 



21, 22 41, 42 5k €i 
11,15^16117,14201 
21r-2a2a30 

i,a7.aaiai2i4 

27,29 

iai9 



21,2241,4221,25 

1, a 5k 6^ a a mil, 
la 141a 17, 191 2a 

222^27-30 
15k la 21, 26 



Because the probes of the invention are single stranded for use in hybridization, it is important to note 
that merely because a probe is designed to hybridize with, for example, the coding strand, does not mean 
that an equally useful probe could not be designed that would hybridize to the complementary sequence 
present on the noncoding strand. 

The sequence information provided above also relates to other important aspects of the Invention. The 
preferred primers of the invention are designed to amplify many different DP alleles. In many instances, as 
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demonstrated in the Examples below, such primers are very useful. However, those of skill In the art 
recognize that the DNA sequence information provided above can also be used to design primers that will 
enable allele specific amplification. Such allele-specific primers will only amplify a single allele or a certain 
subset of known alleles. For example, the "DEAV" probe of the Invention can be used as one primer of a 
primer pair to provide for allele-specific amplification of "DEAV" positive DPB1 alleles. 

The present invention also relates to kits useful for determining an individual's HLA DP genotype, 
comprising a panel of SSO probes and possibly suitable primers In accordance with the present invention! 
Said kits preferably consist of a multlcontainer unit comprising the essential components for practicing the 
present method. For example, the kit can contain primers for PGR. as such primers are necessary in the 
preferred embodiment of the invention. These primers will amplify at least the DPB1 gene, and, when 
appropriate, for example in forensic analysis, primers can be included that also amplify the DPA1 gene. The 
kit must also contain SSO probes for at least the DPB1 gene, and. when appropriate, for the DPA1 gene as 
well. In some cases, the SSO probes may be fixed to an appropriate support membrane which is useful for 
the hybridization analysis. Other optional components that may be contained in containers within the kit 
include, for example, an agent to catalyze the synthesis of primer extension products, the substrate 
nucleoside triphosphates, means used to label (for example, an avidin-enzyme conjugate and enzyme 
substrate and chromogen if the label is blotin), and the appropriate buffers for PGR or hybridization 
reactions. In addition to the above components, the kit can also contain Instructions for carrying out the 
present method. 

A number of examples of the present invention, which are provided only for illustrative purposes and 
not to limit the scope of the invention, are presented below. Numerous embodiments of the invention within 
the scope of the claims that follow the examples will be apparent -to those of ordinary skill in the art from 
reading the foregoing text and following examples. In the following examples, certain techniques are 
standard, unless specifically Indicated otherwise. Such techniques include primed lymphocyte typing (PLT), 
which was performed essentially as described by Shaw et al., 1980. J. Exp. Med. 152:565-580. Generally! 
for lymphocyte priming, responder and stimulator cells were thawed.^waihedTandlesuspended in RPMI- 
1640 medium supplemented with glutamine and antibiotics (complete media). Responding cells were mixed 
with Irradiated stimulator cells in a ratio of 2:1 , and the cell mixture was incubated for ten days at 37. The 
primed Irradiated responder cells were co-cultured with irradiated stimulator cells in complete media. Alter 
48 hours, ^H-thymidine was added to the culture. The cells were harvested 18 hours later, and tritium 
incorporation was evaluated by counting beta-emission. 

DNA sequence analysis was performed as described by Maniatis et al.. Molecular Gloning: A Labora- 
tory Manual (New York, Gold Spring Harbor Laboratory. 1982). GeneraflyTthe sequences to be analyzed 
were cloned into an Ml 3 cloning vector and analyzed by either the Maxam-Gilbert technique or by the 
dideoxy chain termination technique. Synthetic oligonucleotides, both primers and probes, were synthesized 
using commercially available instruments and techniques well known in the art. The skilled person is in a 
position to choose alternative reagents or instruments from other suppliers If necessary. Unless othenArise 
specified, percentages given below are on a wt/wt, vol/vol and wt/vol basis, respectively. 

Example 1 

Analysis of the DNA Sequences of HLA-DP Allele s 

The DNA sequences of the variable second exons of a variety of alleles of the DPA1 gene and of the 
DPB1 gene were determined. The DNA samples used were chosen to represent as wide a spectrum of PLT 
defined DP alleles as possible. DNA was extracted from cell lines homozygous for the standard six DPw 
types, from cells exhibiting unusual typing reactions, and from cells exhibiting DP blank reactions. DNA 
extraction was by standard techniques, as described by Maniatis et al., in Molecular Cloning: A Laboratory 
Manual. The variable second exons of the DPA1 and the DPBI gene's were amplified by the PGR method, 
as described In Example 2, below. The amplified DNA sequences were cloned into an Ml 3 derived vector! 
and the DNA sequences were determined by the chain termination method. The cell lines used, their Df! 
serotypes, their PLT defined DPw types, and the DNA defined alleles they were found to contain are listed 
in tabular form below (blank spaces Indicate data not determined). 
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Cell Line 


DR Type 


urw 1 ype 


npRl AIIpIp^ 

L/i D 1 r\ilCiC70 


DPA1 


CRK 


7 


r 


1. 11 




RSO101 


w6 


1.3 


1,3 




BMG 


4 


1,4 


4.1 




QBL 


3 


2 


2.2 


1 


WJR 


2 


2 


2.1 




PJM 


3 


2.3 


2.1.3 




RMD 


5, w13 


2.4 


2.1,4.1 




JMOS 


4.5 


2,6 


2.1,6 




SLE 


w13 


3 


3 


1 


COX 


3 


3 


3 


1 


OPR 


w8, wlO 


3,4 


3, 4.1 




JAH 


4 


3,4 


3, 4.1 




MRI 


1.2 


3,5 


3.5 




HHK 


w6 


4 


4.1 




LB1 


3,7 


4, MAS 


1.4.2 




APD 


w6 


4* 


2.4 




LUY 


w8 


1.4 


1.4.1 


1.2 


WDV 


w6 


2.4 


2.1.4.1 




SMF 


2, w12 


4.5 


4.1, 5 




BCR 


4, w6 


4.6 


4.1,6 




GTER 


w8, 9 


5 


5 




HAS 


4 


5 


5, 19 




DKY 


9 


5 


2.1 


1 


BIN40 


4 


3.6 


3.6 


1 


LG2 


1 




4.1 




PIAZ 


2.7 




8 




TDK 


2 




9 


2 


BM21 


w11 




10 




VLA 


2.3 




11 




GBA 


1,7 




11 




CD1 


3.4 


3.6 


10 




CD2 


7. (5) 


2,4 


4.2, 10 




CDS 


3.7 


4 


4.1,4.2 




CD11 




2 


2.1,4.2 




DP "MAS" is provisional designation for a newly defined DP specificity related to the DPB4.2 allele. 
"CD" cell lines actually are samples from CD patients. 



* - refers to cells with unusual DPw phenotypes. 



As shown from the foregoing, the DNA analysis of the HLA-DP subtypes shows that specific sequences 
correlate with the known PLT-defined DPw typings, indicating that the polymorphic epitopes recognized by 
the primed T cells are on the DPB1 chain. For some DP types, e.g.. DPw2 and DPw4, sequence analysis 
has revealed subtype variants. The variants for DPw2 have been designated DPB2.1 and DPB2.2; cells 
typed for PLT as DPw4 "new" (e.g.. LB1) or DPw4* (e.g.. APD) contain the rarer DPB4.2 subtype. The 
DPB4.2 subtype is more related by sequence to the DPB2.1 allele than to the DPB4.1 allele. Individual 
CD11 is PLT typed as DPw2, but contains the closely related DPB2.1 and DPB4.2 alleles. 

The results above also show that unique DPB1 sequences correspond to the DPwl . DPw3, DPw5, and 
DPw6 specificities, and these alleles have been designated to reflect this correlation. A few exceptions are. 
however, that cell line DKY has been typed as DPw5, but contains the DPB2.1 allele, and that individual 
CD2 is PLT typed as DPw2, but contains the DPB4.2 and DPB10 alleles. 
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Example 2 

Primers for the PGR Amplification of the DPA1 and DPB1 Genes 

The DPA1 and DPB1 genes of some of the cells described in Example 1 were amplified by PGR. The 
primers used, which were synthetic, along with the region of the genes to which the primers bind, and 
which will act as templates for primer extension are shown below. As depicted in the table, the left' side 
primers, GH98 and DB01, are from the upper strand, and direct DNA polymerase to extend rightward. The 
right side primers. GH99 and DB03. are from the lower strand, and direct synthesis leftward. Lower case 
letters indicate bases in the primer that are not complementary to the target genomic DNA (shown in the 
opposite strand). These changes in the primers incorporate restriction enzyme sites (BamHI or PstI) at the 
ends of the amplified DNA and facilitate cloning of the amplified DNA. The oligonucleotide primers GH98 
and GH99. which are used for the amplification of the second exon of DPA1, amplify a 243 bp segment. 
The first two bp of the PGR product are from the intervening sequence which flanks the exon. The 
oligonucleotide primers DB01 and DB03 amplify a 294 bp segment of the second exon of DPB1. The left 13 
bp and the right 17 bp of the product are from the intervening sequence. The genomic sequence to which 
the primers bind are listed in the Sequence Listing section under the following sequence identifiers: 
Region to which GH98 binds - SEQ ID NO: 146 
Region to which GH99 binds - SEQ ID NO: 147 
Region to which DB01 binds - SEQ ID NO: 148 
Region to which DB03 binds - SEQ ID NO: 149 

The sequence of the region between the primer hybridization regions depends on the allele amplified. Allele 
sequences are provided in the Sequence Listing section under the sequence identifiers listed above. 

PPalpha Primera 

15 

GH98 > Phe Val Gin Thr His Arg Pro Thr . 

cGCGGAtCcTGTGTCAACTTATGCCGCG TTT GTA GAG ACG CAT AGA CCA ACA 
TCGCCTGGTACACAGTTGAATACGGCGC AAA CAT GTC TGC GTA TCT GGT TGT 



70 

. • Leu Asn Asn Asn Leu Asn Thr Leu 
. . TTG AAC AAC AAC TTG AAT ACC TTG 
. . AAC TTG TTG TTG AAC TTA TGG AAC 



He 

ATC CAGCGTTCCAACCACACTCAGGCCAC 
TAG GTCGCAAGGTTGGTGTGAcgtCGGTc 
< GH99 



10 15 

^^01 > Leu Phe Gin Gly Arg Gin Glu Cys Tyr 

CagggatCCGCAGAGAATTAC CTT TTC CAG GGA CGG CAG GAA TGC TAG 
GGGGAGGGGCGTCTCTTAATG GAA AAG GTC CCT GCC GTC CTT ACG ATG 



85 90 
. . Glu Leu Gly Gly Pro Met Thr Leu Gin 

. . GAG CTG GGC GGG CCC ATG ACC CTG CAG CGCCGAGGTGAGTGAGGGCTTTGG 
. . CTC GAG CCG CCC GGG TAG TGG GAG GTC GCGGCTCCACTCACTgaCGtcctg 

< I3B03 



Hybndization of the primers and the synthesis of the elongated primer containing products were essentially 
as described in European Patent Application. Publication No. 258,017 and in the protocols provided by the 
manufacturer. Perkin Elmer (Norwalk. CT). of the Thermal Cycler used to perform the PGR. Amplification 
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was for 28 cycles; however, more. i.e.. 35, cycles can yield better results. 

Two other primers for amplifying the second exon of DPB1 alleles, designated UG19 and UG21, have 
proven to be more efficient and specific than the primers discussed above. For amplification using UG19 
and UG20, between 0.1 and 1 ug of genomic DNA is amplified in a 200 ul reaction containing 50 mM KGL. 

5 10 mM Tris-HCL (pH 8.4). 1.5 mM MgCb, 100 ug/ml gelatin, 175 uM each dATP. dCTP, dGTP, and dTTP. 
0.50 ixM each amplification primer, and 5.0 units of Taq DNA polymerase. The amplification is carried out in 
a DNA Thermocycler (Perkin Elmer [Non^ralk. CTJ) using a two-step temperature cycle (denaturation: 95-0 
for 30 seconds; annealing and extension: 65 • 0 for 30 seconds) for 30 cycles. 

The sequences of the DPA1 and DPB1 primers discussed are provided below and in the sequence 

10 listing section. 



DPA1 Primers 


Primer 


Seq Id No. 


Sequence 


GH98 
GH99 


SEQ ID NO: 140 
SEQ ID NO: 141 


CGCGGATCCTGTGTCAACTTATGCCGC 
CTGGCTGGAGTGT6GTTGGAAGGC 
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DPB1 Primers 


Primer 


Seq Id No. 


Sequence 




DB01 


SEQ ID NO: 97 


GAGGGATCCGCAGAGAATTAC 




DB03 


SEQ ID NO: 98 


GTCGTGCAGTGACTGACCTCGGCG 


25 


UG19 


SEQ ID NO: 144 


GCTGCAGGAGAGTGGCGCCTGGGCTCAT 




UG21 


SEQ ID NO: 145 


GGGATGCGGGCCAAAGCCGTCAGTG 



30 Example 3 

SSO Pro bes for Hybridization Analysis of DPB1 Alleles 

Illustrative probes of the invention referred to throughout the remainder of the Examples are described 
35 below in tabular form. In the Table, the probe designation, sequence identification number, probe sequence 
(5* to 3'), and the region and polymorphic amino acid sequence encoded in the region of the allele to which 
the probe hybridizes are provided. Probes are shown either as unlabeled or as having a 32p or "X" label, 
where X represents HRP or biotin, as discussed in the Examples below. Where a probe sequence is 
indicated by an X followed by a probe designation, the sequence of the probe is identical to that of the 
40 probe designated after the X. except that the label has been replaced by an HRP label. As discussed in 
a later Example, the probes may be used in a reverse dot blot format in which the unlabeled probe is 
immobilized on a membrane and hybridized to the labeled PGR product. Probes shown below as unlabeled 
were designed for use in the reverse dot blot fomnat; labeled versions could be used in other formats. 
Probes 154 and 155 each contain an inosine base which is denoted as "N" in the sequence. 



50 
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Probe 


SeqWNo. 


Region A 






LFQG 


DBIO 


SEQ ID NO: 99 




DB27 


SEQ ID NO: 99 




DB70 


SEQ ID NO: 119 




DB136 


SEQ ID NO: 137 


VYQL 


DBll 


SEQ ID NO: 100 




DB23 


SEQ ID NO: 111 




DB28 


SEQ ID NO: 100 




DB36 


SEQ ID NO: 111 




DB58 


SEQ ID NO: 114 


VYQG 


DB12 


SEQ ID NO: 101 




DB29 


SEQ ID NO: 101 




DB71 


SEQ ID NO: 120 


VHQL 


DB22 


SEQ ED NO: 110 




DB35 


SEQ ID NO: 110 




DB117 


SEQ ID NO: 132 




DB118 


SEQ ID NO: 133 



SSQs forHLA-DPRI Tvnin^ 

SeouennR 



Region B 
EEFARF AB117 
EEFVRF AB124 
EELVRF AB119 
QEYARF AB120 
EEYARF AB121 



32P-GAATTACCTTTTCCAGGGA 
X-DBIO 

X-GAATTACCTTTTCCAGGGAC 

CCGTCCCTGGAAAAGGTAATTC 

32P-ATTACGTGTACCAGTTACG 

X-CGTAACTGGTACACGTAAT 

X-DBll 

X-DB23 

X-ATTACGTGTACCAGTTA 

32P-CGTCCCTGGTACACGTAAT 

X-DB12 

X-TTACGTGTACCTGGGAC 

32P-ATTACGTGCACCAGTTACG 

X-DB22 

ATTACGTGCACCAGTTAC 
ATTACGTGCACCAGTTA 



SEQ ID NO: 151 X-AGGAGTTCGCGCGCTT 

SEQ ID NO: 152 X-AGGAGTTCGTGCGCTT 

SEQ ID NO: 153 X-AGGAGCTCGTGCGCTTC 

SEQ ID NO: 154 X-CCGGCAGGAGTACGCGC 

SEQ ID NO: 155 X-GAGGAGTACGCGCGCT 
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Reyion C 





AAE 


DB13 


SEQ ID NO: 102 


5 




DB30 


SEQIDNO: 102 




DEE 


DB14 


SEQ ID NO: 103 






DB31 


SEQ ID NO: 103 






DB75 


SEQ ID NO: 124 


10 




DBlOl 


SEQ ID NO: 131 






DB121 


SEQIDNO: 134 




EAE 


DB16 


SEQ ID NO: 104 


15 




DB32 


SEQ ID NO: 104 






DB59 


SEQ ID NO: 115 




DED 


DB17 


SEQ ID NO: 105 


20 




DB33 


SEQ ID NO: 105 






DB122 


SEQIDNO: 135 




DEV 


AB112 


SEQ ID NO: 92 


25 


Region 


D 






I-K 


DB18 


SEQIDNO: 106 






DB34 


SEQ ID NO: 106 


30 




DB72 


SEQIDNO: 121 




I-E 


DB19 


SEQ ID NO: 107 






DB37 


SEQ ID NO: 107 


35 




DB73 


SEQIDNO: 122 


L-K 


DB20 


SEQ ID NO: 108 






DB38 


SEQ ID NO: 108 






DB74 


SEQ ID NO: 123 


40 


L-E 


DB21 


SEQ ID NO: 109 






DB39 


SEQ ID NO: 109 






DB62 


SEQIDNO: 116 


45 




DBg2 


SEQ ID NO: 127 






DB155 


SEQ ID NO: 139 




L-R 


DB63 


SEQ ID NO: 117 


50 




DB93 


SEQ ID NO: 128 






DB154 


SEQ ID NO: 138 



32P.CCTGCTGCGGAGTACTG 
X-DB13 

32P-CAGTACTCCTCATCAGG 
X-DB14 

X-CCTGATGAGGAGTACTG 

CCAGTACTCCTCATCAGGC 

CAGTACTCCTCATCAG 

32P-CAGTACTCCGCCTCAGG 

X-DB16 

X-CCTGAGGCGGAGTACTG 

32P.CCTGATGAGGACTACTG 

X.DB17 

CTGATGAGGACTACTG 
X-CCTGATGAGGTGTACTG 



32P-GACATCCTGGAGGAGAAGC 
X-DB18 

X-ACATCCTGGAGGAGAAGC 

32P-GCTCCTCCTCCAGGATGTC 

X-DB19 

X-ACATCCTGGAGGAGGAGC 

32P.GACCTCCTGGAGGAGAAGC 

X-DB20 

X-ACCTCCTGGAGGAGAAGC 

32P.GCTCCTCCTCCAGGAGGTC 

X-DB21 

X-GACCTCCTGGAGGAGGAG 

X-GACCTCCTGGAGGAGGAGC 

GACCTCCTGGAGNGAGGAGC 

X-GACCTCCTGGAGGAGAGG 

X-GACCTCCTGGAGGAGAGGC 

GACCTCCTGNGAGGAGAGGC 
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TGTCTGCACATCCTGTCCG 
TGTCTGCATACCCTGTCCG 
CGGACAGGATATGCAGACA 



32P-CTGCAGGGTCATGGGCCCCCG 
X.DB25 

X-CTGGTCGGGCCCATGACC 
X-CTGGGCGGGCCCATG 
X-AGCTGGGCGGGCCCATGAC 
X-CGAGCTGGGCGGGCCCA 
X-CGAGCTGGTCGGGCCCA 

32P-CTGCAGGGTCACGGCCTCGTC 
X-DB26 

X-AGCTGGACGAGGCCGTGAC 
X-CTGGACGAGGCCGTG 
X-CGCTTCGACAGCGACGT 

With reference to the Table above, one should note that because DB28 cross-hybridizes with DB32, 
superior results can be obtained using probes DB58 and DB59 in place of DB28 and DB32, respectively. In 
addition, probe DB63 is preferred over DB93. Specific probe panels for use in an HLA-DP typing assay are 
discussed in the Examples which follow. The "ALL" probe. DB123. is specific for a nonvariable sequence 
just 3' of region B and hybridizes to all DRB1 allelic sequences. It is useful as a control for the amount of 
DRB1 DNA in the hybridization reaction. 

Those of skill in the art recognize that depending on the type of label used and the hybridization format 
used, hybridization and wash conditions will differ. Although, in a preferred embodiment, the probes will be 
labeled nonisotopically (e.g., with HRP or biotin), some of the probes have been used with isotopic (e.g., 
^P) labels. Hybridization and wash conditions for 32p-|abeled and HRP-labeled probes used in a dot blot 
format are shown below, where such conditions have been empirically determined (see Bugawan et al., 
1988, J. Immunol . 141(12):4024-4030, incorporated herein by reference). In the Table, the condTtio"ns 
referred to assume a hybridization solution composed of 5 x Denhardfs solution, 0.5% SDS. and the 
indicated amounts (i.e., 0.1 x. 3 x, 5 x) of SSPE. Five x Denhardfs solution contains 0.5 g Rcoll, 0.5 g 
polyvinylpyrrolidone, 0.5 g BSA (Pentax Fraction V) per 500 ml. The wash solution contains 0.1 x SSPE and 
0.1% SDS (for HRP-labeled probes, 0.1% Triton X-100 was used in place of SDS); the wash step is carried 
out at the indicated temperature (ceisius) for ten minutes in either a water bath or an air incubator. As 
described below./ the use of tetramethyl ammonium chloride, or similar salts, can be used to allow for more 
uniform hybridization and wash conditions, a preferred condition when a number of probes are used in a 
panel to determine the types of DP alleles in a sample. 



M 


AB96 


V 


AB97 


I 


AB98 


Reeion P 


GGPMA/^GPM 




DB25 




DB40 




DB64 




DB76 




DB94 


GGPM 


AB122 


VGPM 


AB123 


DEAV 


DB26 




DB41 




DB95 




DB77 


"ALL" 


DB123 



SEQIDN0:88 
SEQ ID NO: 89 
SEQ ID NO: 90 



SEQ ID NO: 112 

SEQ ID NO: 112 
SEQ ID NO: 118 
SEQ ID NO: 125 
SEQ ID NO: 129 
SEQ ID NO: 156 
SEQ ID NO: 157 
SEQ ID NO: 113 
SEQ ID NO: 113 
SEQ ID NO: 130 
SEQ ID NO: 126 
SEQ ID NO: 136 
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5 



Probe Hybridization/Wash Conditions 


Probe 


Hybridization/Wash Conditions 


DB10 


5 X @ 50/42 H2O bath 


DB27 


5 X @ 50/42 air 


DB11 


3 X @ 42/42 H2O air 


DB28 


3 X @ 55/42 H2 0 bath 


DB58 


3-5 X @ 42/42 air 


DB23 


5 X @ 50/42 H2O bath 


DB36 


5 X @ 50/42 air 


DB12 


5 X @ 50/42 H2O bath 


DB29 


5 X @ 50/42 H2O bath 


DB22 


3x@50/42H2Obath 


DB35 


3x@ 50/42 H2 0bath 


DB13 


3 X @ 50/42 H2O bath 


DB30 


3 X @ 50/42 H2O bath 


DB14 


5 X @ 42/42 H2O bath 


DB31 


5x @ 42/42 H2O bath 


DB16 


5 X @ 42/42 H2O bath 


DB32 


3-5 X @ 42/42 H2O bath 


DB59 


5 X @ 50/42 H2O bath 


DB17 


5 X @ 50/42 H2O bath 


DB33 


5 X @ 50/42 air 


DB18 


3x@55/42 H2 0bath 


DB34 


3 X @ 55/42 H2O bath 


DB19 


5 X @ 55/42 H2O bath 


DB37 


5 x ©55/42 H20bath 


DB20 


5 X @ 55/42 H2 0 bath 


DB38 


5x@55/42 H20bath 


DB21 


5x@50/42 H20bath 


DB39 


5 X @ 50/42 air 


DB62 


3x@50/42 H2 0bath 


DB63 


3x ©50/42 HzObath 


DB25 


5x ©50/42 H2 0bath 


DB40 


3x ©50/42 H2 0bath 


DB26 


5 x ©50/42 H2 0bath 


DB41 


3 X ©50/42 H20bath 



AO When tetramethyl ammonium chloride (TMACL) is present In a hybridization solution, probe discrimina- 
tion is based on probe length and not on the G, C, A, or T composition of the probe. Thus, by using TMACL 
In the hybridization solution, one can hybridize and wash many different probes of the same length at a 
single temperature. 

A suitable hybridization solution for this purpose contains 3 M TMACL; 0.5% SDS; 10 mM TrIs-HCI. pH 
45 = 7.5; and 0.1 mM EDTA. Hybridizations are carried out at 55*C for 30 to 60 minutes for 19-mer probes 
DB27. DB28. DB29, DB35. DB34, DB37. DB38, and DB62: at 50*C for 17-mer probes DB30. DB31, DB33, 
and DB59; and at 60*C for DB40 and DB41. The wash solution is 3 M TMACL; 50 mM Tris-HCI, pH = 8; 
and 2 mM EDTA. The wash Is carried out first at 37 'C for 20 minutes, then at the higher stringency 
temperature (the hybridization temperature) for 10 minutes. 

50 

Example 4 

SSO Probes for Hybridization Analysis of DPA1 Alleles 

65 Examples of suitable SSO probes for hybridization analysis of DPA1 alleles are shown below. Two sets 
of probes are Illustrated; each set is designed to distinguish between the polymorphisms in a specific 
segment of the second exon of the HLA-DPA1 gene. AS01 and AS02 bind to the DPA1 allele at the region 
containing the polymorphic segments containing methionine (M) (amino acid 31) and glutamlne (Q) (amino 



24 



EP 0 575 845 A2 



acid 50), respectively. AS03 and AS04 bind to the DPA2 allele to the region containing the polymorphic 
segments which contain glutamine (amino acid 31) and arginine (R) (amino acid 50). respectively. AS02 
and AS04 distinguish the polymorphic segment at position 50 containing glutamine from that containing 
arginine. The probes span the regions encoding these polymorphic amino acid residues. Hybridization 
using these probes is usually carried out in a solution containing 5 x SSPE. 5 x Denhardt's. and 0.5% SDS. 
for at least 1 hour at 42 • C. Also shown are the washing conditions (temperature in degrees Celsius) for use 
with the probes. The probe sequences are shown 5' to 3\ 



HLA-DPA1 SSO Probes 


Probe 


Seq Id No. 


Sequence 


Wash Conditions 


AS01 
AS02 
AS03 
AS04 


SEQ ID NO: 93 
SEQ ID NO: 94 
SEQ ID NO: 95 
SEQ ip NO: 96 


AGATGAGATGTTGTATG 
GTTTGGCCAAGCCI I 1 1 
AGATGAGCAGTTCTATG 
GTTTGGCCGAGCCTTTT 


2xSSPE, 0.1% SDS. 42 
2xSSPE. 0.1% SDS, 50 
2xSSPE. 0.1% SDS. 50 
2xSSPE, 0.1% SDS. 55 



Example 5 

Analysis of Amplified DPB1 Sequences by Hybridization with SSO Probes 



PGR amplified DPB1 sequences from 24 HTCs (homozygous typing cells) were analyzed in a dot blot 
format with a panel (n = 9) of ^^p-iabeled SSO probes, and the DPB1 type was inferred from the pattern of 
probe binding. The extraction of DNA from the cells was as described in Example 1 . The target regions of 
the cellular genome, i.e., the second exon of the DPB1 genes, were amplified by the PGR technique using 
primers DB01 and DB03 as described in Example 2, except that the DNA was from the cells listed in 
tabular form above. At the time of this study, only a subset of the currently known alleles were known; 
additional alleles have since been discovered using the methods of the present invention. 

The amplified DNAs were dot blotted onto a filter; a separate filter containing the panel of samples was 
prepared for analysis by hybridization with each SSO probe. To dot blot the samples, five microliters of 
each amplified sample were denatured by diluting the sample with 195 microliters of a solution containing 
0.4 N NaOH and 25 mM EDTA and spotted onto 9 replicate Genatran 45 (Plasco, Woburn. Massachusetts) 
nylon filters by first wetting the filters with water, placing them in a Bio-dot (Bio-Rad, Richmond, CA) 
apparatus for preparing dot blots, applying the samples, and rinsing each well with 0.4 ml of 20 x SSPE (3.6 
M NaCl, 200 mM NaH2P0*, and 20 mM EDTA). The filters were removed, rinsed in 2 x SSPE, and baked 
for 30 minutes at 80 • C in a vacuum oven. 

The samples on the filters were hybridized with SSO probes of the invention. Hybridization was with 
0.25 to 0.5 pmoles of probe in 2 to 5 ml of hybridization solution. Hybridization and wash conditions were as 
described in tabular form in Example 3. The results of this DPB1 typing are shown below, which also shows 
the probe with which the samples on the filter were hybridized and the encoded amino acid sequence 
detected by the probe. 
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The determination of DPB1 types of the cells based upon the hybridization analysis with the SSO 
probes has been discussed above. To determine the DPB1 type of a sample, the binding of the probes to 
55 the sample DNA was examined. The alleles present were inferred from the pattern of probe binding. For 
example, sample 1 formed hybrids with SSO probes DB11. DB17, and DB20. The amino acids encoded by 
DB11, DB17. and DB20 are VYQL. DED, and LEEK, respectively. An examination of segments A, C. and D 
of the DPB1 allelic amino acid sequences shows that the sequence VYQL is present in DPB3, DPB6. 



26 



EP 0 575 845 A2 



DPB11. and DPB13; the sequence DED is present in DPB17. DPB14, DPB12, DPB9. DPB6, and DPB3; the 
sequence LEEK (L-K) is present in DPB14 and DPB3. The only allele (known at the time of the study) which 
contains the three sequences which hybridize with the probe is DPB3. Thus, the DP type of sample 1 
based on SSO typing is DPB3. 

The DPB1 types of the other samples were inferred by the same type of analysis, and the determined 
types are depicted above. In the depiction, the asterisk 0 indicates cells where the DPB1 genotype was 
also determined by sequence analysis. The DPB1 type of the cell lines COX (formerly w1 -> w3) and BM21 
(formerly w1 -> blank) have recently been changed to the type indicated. The symbol +/- refers to a weak 
signal obtained with a probe. In some cases (BM21 and TOK). this weak signal reflects the presence of an 
additional polymorphic sequence in region A encoding the amino acid residues VHQL, which cross- 
hybridizes to the DB1 1 probe. The sequence can be more conveniently typed with region A probe DB22. 
For other cell lines (BM92), there is apparently a background cross-hybridization of the DB1 1 probe to the 
sequences recognized by the DB10 probe. In similar fashion, cross-hybridization signals can occur with the 
DB19 probe on sequences complementary to the DB18 probe. 

The panel of SSOs used in this Example detects variation at only 3 of the 5 polymorphic regions and 
does not detect all allelic variants at these 3 regions. The typing system using the procedure of this 
example is simple and unequivocal for HTCs, but given the patchwork pattern of polymorphism, can give 
rise to ambiguous typing for heterozygous individuals if the hybridization pattern of the various probes can 
be interpreted as more than a unique pair of alleles. This ambiguity arises from the many different 
combinations of the DPB1 sequence variants that constitute the different DPB1 alleles. However, with the 
use of the additional SSO probes provided by the invention, which additional probes span the remaining 
polymorphic regions, and, possibly, allele-specific amplification as described above, unambiguous typings 
can be obtained for heterozygous individuals. 

Example 6 

HLA-DP Typing of Coeliac Disease Patients by DNA Sequence Analysis of PGR Amplified Target Regions 



The cells of four patients with coeliac disease (CD) were PLT-typed, and the DNA sequences of the 
second exon of the DPB1 gene determined as described in Example 1. The CD diagnosis was based on 
clinical symptomology. 

The results of the analysis obtained by DNA sequencing of the DPB1 (second exon) of the CD cells has 
been discussed above. From these results, one observes that, when CD cells are compared to non-CD 
cells, there is an apparent increase in the frequency of the DPB4.2 allele. In addition, the DPB10 allele 
sequence is present in two independent CD patients yet observed in only one of the 30 non-CD cell lines. 

Example 7 

HLA-DP Typing of Coeliac Disease Patients by SSO Probe Hybridization Analysis 



The cells of 19 patients with CD and of 43 non-CD controls were analyzed for HLA-DP type by SSO 
probe hybridization analysis. The diagnosis of CD was based on clinical symptomology; the CD patients as 
well as control individuals were all from Italy. DNA extraction was as described in Example 1. PCR 
amplification of the samples was as described in Example 2. Analysis of the amplified sequences was as 
described in Example 4. 

The results of the analysis show that there was a significant increase in the DPB4.2 allele in CD patients 
as compared to non-CD controls; this allele was present in 12 of 19 CD patients, but only 3 of 43 control 
patients. The DPB4.2 and DPB3 alleles were present in '17 of 19 CD patients and in 15 of 43 controls. The 
genotype DPB4.1/4.2 was present in 10 of 19 CD patients and in only 1 of 43 controls. 

Example 8 

HLA-DP Typing of Forensic Samples 

Samples which contain genomic nucleic acids of the suspect individual are obtained. The target regions 
of the genome, i.e.. the regions containing the second exon of the DPA1 and DPB1 genes, are amplified by 
the PCR technique, as described in Example 2, except that the nucleic acids from the suspect replace 
those of the cells in the Example, and except that the amplified samples contain a 32p-iabel. The amplified 
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sample is hybridized with the probes of the invention, which have been dot blotted onto the same filter, and 
which have been immobilized on the filter by poly-dT tails. The techniques for preparing the Immobilized 
sequence-specific probes are described in International Patent Application, Publication No. WO 89/11548. 
The hybridization and washing conditions for the filter allow only perfectly matched hybrids to remain in 
duplex state. The filters are examined to determine which probes form hybrids with the labeled sample. 

A sample which is to be compared with that obtained from the suspect is examined for DP type by the 
same procedure, i.e.. by PGR amplification and hybridization with immobilized SSO probes. The pattern of 
binding to the SSO probes of the sample from the suspect and of the comparison sample is examined to 
determine the similarity or difference in the hybridization patterns. 

Example 9 

HLA-DP Ty ping Using SSO Probes Labeled with Horseradish Peroxidase 

A panel of cells from 39 individuals was typed for DPB1 alleles using SSO probes for the second exon 
variants. The probes were labeled with horseradish peroxidase (HRP), and hybrids detected in a dot blot 
format. 

The cells analyzed were from fourteen IDDM patients, five DR3 control non-IDDM patients, and nineteen 
HTCs which typed by PLT as DP blank. IDDM patients were identified by clinical symptomology. DNA was 
isolated from the cells as described in Example 1. The target region, i.e.. the second exon of the DPB1 
genes, was amplified using the PGR technique in 200 microliters of reaction mixture that contained 50 mM 
Tris-HGI, pH 8.3; 2.5 mM MgGb; 100 micrograms/ml gelatin; 0.75 mM of each of the four deoxynucleoside 
triphosphates; the primers DB01 and DB03; and Taq polymerase. The amplification temperature cycling 
profile was the following: heating for 30 seconds to 94*G followed by incubation at that temperature for 30 
seconds; cooling for 1 minute to 55-G followed by incubation at that temperature for 30 seconds; heating to 
72 •G for 30 seconds followed by incubation at that temperature for 45 seconds. This cycling was repeated 
for 42 cycles. After amplification, the reaction mixtures were sampled and monitored by gel electrophoresis 
on gels containing 3% Nusieve, 1% Agarose to determine if all the DNA amounts were comparable. 

Filters containing the dot blotted samples were prepared by blotting 150 microliters/dot of denatured 
amplified DNA on Genatran nylon membranes and UV treating the filters containing the samples for 5 
minutes. The latter treatment is to fix the sample to the membranes. The amplified DNAs were denatured 
by treating 5 microliters of the PGR reaction mixture in a total volume of 150 microliters containing 0.4 N 
NaOH and 25 mM EDTA. Eight replicate filters were prepared for hybridization with HRP labeled ASO 
probes. 

Prior to hybridization, the sample containing filters were incubated for 15 minutes in pre-hybridization 
solution (1 x SSPE. 5 x DenhardVs solution, 1% Triton X-100) without probe. In the pre-hybridization 
solution, Triton X-100 was used in place of SDS. Hybridization was in the same solution, which additionally 
contained 1 picomole probe/ml. Each filter was incubated for 40 minutes with 2.5 ml of hybridization 
solution containing one of the HRP labeled probes. The probes used were DB27, DB28, DB29. DB30. 
DB31. DB32, DB33, and DB35. The probes and hybridization conditions are listed in tabular form in 
Example 3, After hybridization, the filters were washed as stated in Example 3. under suitably stringent 
conditions, i.e., in 0.1 x SSPE. 0.1% Triton X-100 for 10 minutes at 42 'G. 

The HRP-labeled SSO probes were prepared essentially by the methods disclosed in U.S. Patent Nos. 
4,962,029 and 4,914,210 and in corresponding International Patent Application, Publication Nos. WO 
89/02932 and WO 89/02931. See also, Levenson and Ghang, "Nonisotopically Labelled Probes and 
Primers" in PGR Protocols pages 99-112. Academic Press. 1990. M. Innis. ed. These methods essentially 
involve derivatizTng the nucleic acid probe using a linear linking molecule comprising a hydrophilic polymer 
chain (e.g., polyoxyethylene) having a phosphoramidite moiety at one end and a protected or unprotected 
sulfhydryl moiety at the other end. The phosphoramidite moiety couples to the nucleic acid probe by 
reactions well known in the art (e.g., Beaucage et al.. 1988, Tetrahedron Lett . 22: 1859-1862), while the 
sulfhydryl group is free to form disulfide or other cbvalent bonds with the protein, e.g.. HRP. In U.S. Patent 
No. 4,962.029 and International Patent Application, Publication No. WO 89/02932. the HRP is conjugated to 
the linking molecule through an N-maleimido-6-aminocaproyl group. The label is prepared by esterifying N- 
maleimido-6-aminocaproic acid with sodium 4-hydroxy-3-nitrobenzene sulfonate in the presence of one 
equivalent of dicyclohexylcarbodiimide in dimethylformamide. After purification, the product is added to 
phosphate buffer containing HRP at a ratio of 1 :8 HRP to ester. The oligonucleotide probe is synthesized in 
a DNA synthesizer, and the linking molecule having the structure (G6H5)3GS-(GH2GH20)+-P(GH2CH2GN)[N- 
(i-PR)2l attached using phosphoramidite synthesis conditions. The trityl group is removed, and the HRP 
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derivative and probe derivative are mixed together and allowed to react to form the labeled probe. A biotin- 
labeled probe or primer may be prepared by similar methods. 

Samples which contained hybridized probe were detected using a color development reaction, as 
described in Sheldon et al.. 1986 Proc . Natl . Acad . Sci. USA 83:9085-9089, which utilizes TMB/H2O2. The 
detection system is described in International Patent Application, Publication No. WO 89/11548. The HLA- 
DP genotypes of the amplified DNA samples were readily apparent from the filters. 

Example 10 

HLA-DPBI Typing with HRP Ubeled SSQ Probes 
A. PGR Amplification 

DPB1 typing can utilize as many as or more than 14 SSO probes (sequence specific oligonucleotides), 
so amplification is carried out on 0.5 to two micrograms of DNA, in 200 microliters of reaction volume, if 
DNA is not limiting. Lower amounts of DNA, i.e., 100 ng, can be amplified, but more cycles of amplification 
should be performed with such samples, i.e., 45 cycles. 

The PGR reaction is started by mixing the following by vortexing for 1 to 2 seconds. 



DNA 


0.5 to 2 mg 


10 X Taq buffer 


20 ul 


100 mM"dNTPs 


1.5 ul 


DPB1 primer [10 mM UG19 or DB01] 


10 ul 


DPB1 primer [10 mM UG21 or DB03] 


10 Ul 


Taq polymerase 5 U/ml 


1.2 ul 







Glass-distilled H2O is added to achieve a final volume of 200 ul. 10 X Taq salts are 500 mM KGI; 100 mM 
Tris, pH 8.3; 15 mM MgOb; and 1 mg/ml gelatin. Negative controls (i.e., no DNA) should be included in 
each PGR run. Typically, 30-35 cycles of amplification in a Perkin Elmer (Norwalk, GT) DNA Thermal Gycler 
are sufficient. The cycles are designed to denature at 96 • G for 30 seconds, and anneal and extend at 65 • G 
for 30 seconds. If primer pair DB01/DB03 is used, annealing is at 55 'C for 30 seconds and extension is at 
72 • C for 30 seconds. Analytical gels can be used to check PGRs and to quantitate amount of DNA to be 
used for dot-blots. 

B. Dot blot 

Typically, 5 ul of the amplified DNA contains approximately 200 ng, more than enough for a single dot 
blot. Remember, however, that as many as or more than 14 dots may be required, i.e., about 70 ul of the 
amplification reaction would then be used in preparing the dot blots. For each 5 ul of amplification reaction. 
50 ul of 0.4 N NaOH and 25 mM EDTA are added to the DNA. Five minutes is sufficient to complete 
denaturation of DNA. The Genatran membrane is first wetted in 2 x SSPE, and then the 55 ul of denatured 
DNA are loaded into the dot blot apparatus. The membrane is rinsed in 2 x SSPE, and the DNA is fixed to 
the membrane with exposure to UV light for five minutes, i.e., by a 55 mJ/cm^ exposure in a Stratalinker 
1800^" UV light box, marketed by Stratagene. 

G. Hybridization 

The membrane is again wetted in 2 x SSPE, and about 5 ml of hybridization solution per 8 x 12 cm 
membrane (size of dot blot apparatus) are added. About 1 to 1.5 picomoles of HRP probe are added per ml 
hybridization solution, and the probes are allowed to hybridize for at least one hour. Hybridization solution is 
SSPE (as indicated above), 5 x Denhardfs. and 1% Triton X-100. The membranes are then washed with 0.1 
X SSPE and 0.2% Triton X-100 for ten minutes. Otherwise, hybridization and wash conditions were as 
described in Example 3. The probes used are DB27, DB29, DB30. DB31. DB33, DB34. DB35, DB37 DB38 
DB40, DB41, DB58, DB59, DB62, and DB63. 
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D. Detection 

The following steps for detection of probes are done at room temperature with moderate shaking and 
just enough solution to cover the membrane completely, as is described in Bugawan et al., 1988, 
Bion"echnology 6:943-947, incorporated herein by reference. The detection is carried out by a 5 minute 
incubation of the~membrane with Buffer a 5 minute wash with Buffer C, and 10 minutes incubation under 
light exclusion with Buffer C and TMB {48 ml of Buffer C and 2.5 ml of 2 mg/ml TMB). Buffer B is 100 mM 
NaCI, 1 M urea, 5% Triton X-100, and 1 % dextran sulfate. Buffer C is 100 mM sodium citrate, pH 5.0. TMB 
is 3, y, 5. S'-tetramethylbenzidine. About 23 ill of 3% H2O2 are added to 50.5 ml of Buffer C/TMB; the 
resulting solution is used to develop the color on the membranes (color comes up within 1 to 15 minutes) 
under light-excluding conditions. The color development is stopped by washing the filters in H2O with a 
small amount of Buffer C. The Buffer C wash is repeated twice for 30 minutes. Pictures of the membrane 
are taken and the membrane is stored in Buffer C under light exclusion. 

The methods described herein, as well as the SSO probes and primers, and kits containing them, are 
useful for the accurate, relatively simple, and economic determination of an individual's HLA-DP genotype. 
Accurate DP typing will prove important in several medical applications. For example, accurate HLA-DP 
matching of donors and recipients may be helpful in the prevention of allograft rejection and in the 
prevention of graft versus host disease. Because certain HLA-DP genotypes appear to be linked to certain 
autoimmune diseases, including, for example, coeliac disease, pauciarticular JRA, and IDDM. HLA-DP DNA 
typing may be useful in early diagnosis of the disease, prior to manifestation of full clinical symptoms. 

Accurate HLA-DP typing is useful in forensic medicine. For example, it provides evidence as to whether 
a sample which contains genomic nucleic acids, for example, blood, hair, or semen is derived from a 
suspect individual. It is also useful in determining an individual's paternity or maternity. The latter is of 
particular importance in analyzing historical samples. 

Example 1 1 

15-Probe DPB1 Ty ping Assay Dot Blot and Reverse Dot Blot Formats 

Both dot blot and reverse dot blot hybridization protocols have been described above. Described here 
are DPB1 typing assays using each of the hybridization protocols. These assays use 15 probes - 4 for 
region A. 4 for region C, 5 for region D. and 2 for region F. Both protocols have been used successfully for 
typing at the DPB1 locus as described in Bugawan et al.. 1990. Immunogenetics 32:231-241. incorporated 
herein by reference. 

A. Dot Blot 

Amplification is performed using primers UG19 and UG21 using the amplification protocol described for 
these primers in Example 2. 

In the dot blot format, the amplified DNA is immobilized on a membrane. Approximately 100 ng of 
amplified DNA is denatured in a solution of 45 ul of 0.4 NaOH and 25 mM EDTA for 10 minutes at room 
temperature. The membrane (Genetrans-45 [Plasco, Woburn. Massachusetts) or Biodyne [Pall, Glen Cove. 
New York]) is pre-wet in 2 x saline-sodium phosphate-EDTA (SSPE) or 10 mM Tris, 0.1 mM EDTA. Then, 
50 ul of the denatured DNA sample is applied to the membrane using a dot blot apparatus (Biodot. BioRad. 
Richmond, California). The DNA is ultraviolet (UV) crosslinked to the membrane using a Stratalinker 
(Stratagene, La Jolla. California) at 50 mJ/cm^. Unbound DNA is removed by briefly rinsing the membrane 
in the same solution used to pre-wet the membrane. 

Hybridization is carried out essentially as described in the Examples above. Probes and hybridization 
conditions are shown below. Two sets of probes and the hybridization and wash conditions are shown. The 
first set uses a hybridization solution of SSPE (at the indicated concentration) and 0.5% sodium dodecyl 
sulfate (SDS). Hybridization is carried out in a shaking water bath (except where noted) at the indicated 
temperature (celsius) for 30-60 minutes with 1 pmol HRP labeled probe per ml hybridization solution (8 ml 
hybridization solution are used per 96-sample filter). Unbound probe is then removed by washing the filters 
for 10 minutes in 0.1 x SSPE plus 0.1% SDS at the indicated temperature. 

The second set of probes and hybridization conditions are for hybridization using TAMCI (described in 
Example 3). Hybridization is carried out at the indicated temperature in 3 M TAMCI. 0.5% SDS, 10 mM Tris 
(pH 7.2), and 0.1 mM EDTA as described above. Unbound probe is removed by washing the filters in 3 M 
TAMCI. 50 mM Tris (pH 7.2), and 2 mM EDTA for 10 minutes at 37'C and then for 10 minutes at the 
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indicated stringent temperature. 

In the table below, probes in parentheses are probes designed to work under the hybridization 
conditions for TAMCI. Where no second probe is shown, the indicated probe is used for both SSPE and 
TAMCI hybridization conditions. The sequences of all DBP1 probes are provided In the Examples above 
5 and in the Sequence Listing section. Also included is one probe (DB123. SEQ ID NO: 136, 
5'CGCTTCGACAGCGACGT) which is specific for a nonvariable sequence just 3' of region B and hybridizes 
to all DPB1 allelic sequences. This "All" probe is used to control for the amount of amplified DPB1 DNA in 
the hybridization reaction. 

10 

Probes for Dot Bint TTybn^i^p^'^^ 
P,-«K« Hybridization Hybridization/Wash Temperature 

- |Sa,B5S). II ^f. Hi 

ii(DBU7, II S tf!g 



25 



30 



35 



R^ITc"'^ 3 x 50/42 55/58 

DB14mRl9l^ 3x 50/42 5S/58 

DBM (DB121) 5 x 42/42 55/58 

^7a)Bl22)* 5 x 50/42 55/58 

Dilimi??! t'' 55/42 55/58 

DB19(DB73) 5x ^42 55/60 



DBG2 3x 

DB63 3x 
Region F 

SS? 3x 50/42 

^1 3x 50/42 

DB123 3x 



55/58 

50/42 52/58 
50/42 55/58 



55/58 
55/58 



50/42 55/58 
* An air incubator is used for these probes instead of a shaking water bath. 

Hybridization ofthe HRP-labeled probe to the immobilized DNA is detected by using the colorless 
soluble substrate, tetramethyl benzidine (TMB. Fluka. Ron Kon Koma, New York), which is converted to a 

40 blue precipitate by HRP in the presence of hydrogen peroxide. The detection is carried out at room 
temperature with moderate shaking as follows. After washing, the membranes are incubated for 30 minutes 
in 1 X Duibecco's phosphate-buffered saline and then transferred to buffer C (100 mM sodium citrate. pH 5) 
plus 0.1 mg/ml TMB. TMB is precipitated by the immediate addition of hydrogen peroxide to a final 
concentration of 0.0015% and appears as a blue precipitate in 1 to 5 minutes. The reaction is stopped by 

45 transferring the membranes to 0.01 x buffer C. For a permanent record, the membranes can be 
photographed. The precipates are stable for over 2 months if stored individually in the dark with buffer C 
Altematively, hybridization of the HRP-labeled probes can be detected using the highly sensitive, commer- 
cially available ECL Gene Detection System Kit (Amersham. Arlington Heights, Illinois). 



50 B. Reverse Dot Blot 



In the reverse dot blot format, the probes are immobilized on a membrane and hybridized to the 
biotinylated amplified DNA. The protocols used are essentially as described in Saiki et al.. 1989, Proc Natl 

Acad . Sci. USA 86:6215-6219. incorporated herein by reference. ' 

65 Amplification is performed as for the dot blot described above with the exception that each primer 
contains a single biotin molecule attached at the 5* end of the oligonucleotide. The 15 probes, shown below, 
are given poly-T tails using either terminal deoxyribonucleotidy transferase or chemical synthesis and bound 
to the filters. Due to their length, the tails are preferentially bound to the membrane, leaving the 
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oligonucleotide probe free to hybridize. Unbound probe is removed by washing the membranes for at least 
30 minutes in 1 x SSPE. 0.5% SDS at 50 -C. 

The 15 probes used are listed below grouped according to the region of the gene to which they 
hybridize. The specific amino acid epitopes detected are the same epitopes detected by the 15 probes 
used in the dot blot format described above. The sequence of the hybridization region of each probe is 
provided in the Examples above and in the Sequence Listing section. The "All" probe described above also 
is included here. 





Reverse Dot Blot Probes 


Region A 
Region C 
Region D 
Region F 
All 


DB136, DB36, DB12, DB118 
DB13, DB101, DB59, DB17 
DB72, DB73, DB74, DB155, DB154 
DB94, DB95 
DB123 



For hybridization, the biotinylated amplified DNA is denatured at 95 for 5 minutes and then cooled 
on ice. Rfty ul of denatured DNA is added to the filters in 2.5 ml pre-warmed (50 *C) hybridization solution 
(1 X SSPE, 0.5% SDS) along with 70 ul of 20 mg/ml streptavidin-HRP conjugate (Amplitype™ , Hoffmann-La 
Roche Inc., Nutley, New Jersey). Hybridization is carried out for 30 minutes at 50- C in a shaking water 
bath. Unbound probe is removed by washing the filters for 10 minutes in 0.25 x SSPE, 0.1% SDS at 42*0 
in a shaking water bath. Hybridization is detected as in the dot blot assay. 



Example 12 

25 Probe DPB1 Typing Assay 

A dot blot DPB1 typing assay essentially as described in Example 11 but modified to include five 
additional probes for the five different epitopes at amino acids 33-36 (Region B), three additional probes for 
the three variable amino acids at position 76 (region E), and two additional probes to differentiate the GGPM 
and VGPM epitopes at amino acids 84-87 (Region F) has also been designed. The set of probes, shown 
below, differs from those described in Example 11; however, the variable regions probed and the 
sequences detected are the same with the addition of the five probes for Region B. the three probes for 
Region E. and the two probes for Region F noted above. As in Example 11. the "All" probe. DB123. is 
included with the set of probes as a control. The sequences are provided in the Examples, above, and in 
the Sequence Listing section. 

Hybridization is carried out essentially as described in the Example 11, above. Probe hybridization and 
wash conditions are shown in the table below. A hybridization solution of SSPE at the indicated concentra- 
tion and 0.5% sodium dodecyl sulfate (SDS) is used. Hybridization is carried out in a shaking water bath, 
except where noted, at the indicated temperature (celsius) for 20-60 minutes with 2 pmol HRP labeled 
probe per ml hybridization solution (8 ml hybridization solution are used per 96-sample filter). Unbound 
probe is then removed by washing the filters in 0.1 x SSPE plus 0.1% SDS using the below indicated 
conditions. 



45 



50 



55 
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Region 


Probes 


Epitoues 


Hybridization 


Wash Temp. 


Wash Solution 


Wash Time 


Temp. 


Solution 




A 


DB27 


LFQG 


50 'C 


5X 


40*0 (air) 


0.1 X 


10 min. 


A 


DB29 


VTQG 


50 


5 X 


42*0 


0.1 X 


10 min. 


A 


DB35 


VHQL 


50 'C 


2 X 


42*0 


0.1 X 


15 min. 


A 


DB36 


VYQL 


42*0 


2 X 


42*0 


0.1 X 


15 min. 


B 


AB117 


EEFARF 


55*0 


5 X 


50*C 


0.1 X 


15 min. 


B 


AB119 


EELVRF 


50 'C 


5 X 


50*0 


0.1 X 


12 min. 


B 


AB120 


QEYARF 


50 •C 


3 X 


50*0 


0.1 X 


15 min. 


B 


AB121 


EEYARF 


55*0 


1 X 


50*0 


0.1 X 


12 min. 


B 


AB124 


EEFVRF 


55*0 


1 X 


42*0 


0.1 X 


15 min. 


C 


DB30 


AAE 


50 *C 


3 X 


42*0 


0.1 X 


15 min. 


C 


DB33 


DED 


50 '0 


5 X 


42*0 


0.1 X 


10 min. 


C 


DB59 


EAE 


50*0 


5 X 


42*0 


0.1 X 


10 min. 


C 


DB101 


DEE 


50 'C 


1 X 


42*0 


0.1 X 


10 min. 


D 


DB34 


IK 


55 '0 


2X 


42*0 


0.1 X 


15 min. 


D 


DB37 


IE 


55*0 


1 X 


42*0 


0.1 X 


15 min. 


D 


DB38 


LK 


55*0 


1 X 


42*0 


0.1 X 


15 min. 


D 


DB62 


LE 


50-C 


3X 


42*0 


0.1 X 


15 min. 


D 


DB63 


LR 


55'C 


2X 




U. 1 A 


10 min. 


E 


AB96 


M 


42'C 


1 X 


50*0 


0.2 X 


15 min. 


E 


AB97 


V 


42-C 


2X 


50*0 


0.2 X 


15 min. 


E 


AB98 


1 


42*0 


2X 


50*0 


0.4 X 


15 min. 


F 


DB77 


DEAV 


50*0 


3X 


42*0 


0.1 X 


10 min. 


F 


AB122 


GGPM 


55*0 


3X 


55*0 


0.1 X 


15 min. 


F 


AB123 


VGPM 


55'C 


3X 


55*0 


0.1 X 


15 min. 


"ALL" 


DB123 


"ALL" 


50*C 


3X 


42*0 


0.1 X 


10 min. 



The 25-probe assay was used to characterize the allelic polymorphism in several different populations 
at a time when only 20 alleles were known to exist. A number of samples displayed unique hybridization 
patterns suggesting the existence of 10 alleles not previously observed. These putative novel alleles were 
cloned into ml3 vectors and sequenced essentially as described in the Examples, above, using amplifica- 
tion primers UG21 and AB111. The primer AB111 is identical to the primer UG19, with the addition of a 
BamHI site at the 5' end to facilitate cloning. The nucleotide sequences of UG19 and UG21 are provided in 
Example 2, above, and in the Sequence Listing section. The sequence of AB111 is shown below and in the 
Sequence Listing Section. 

Transformants were screened for the presence of the DPB1 second exon using a POR-based assay. 
Phage DNA was transferred by pipette tip into a 100 m POR reaction mixture containing 50 mM KOL, 10 
mM Tris pH 8.3, 200 uM each dNTP. 4 mM MgOb, 2.5 Units of Taq polymerase (Perkin Elmer, Norwalk, 
OT), and 20 pmoles of each of the primers RS348 and RS349 (sequences shown below) which flank the 
m13 cloning site. Alter 35 cycles of amplification (denaturation at 95*0 for 1 minute, annealing at 60*0 for 
30 seconds, extension of 72*0 for 30 seconds). 5 ul of POR product was run on a 3% Nusieve/1% 
Agarose gel and clones which produced a POR product of the correct size (approximately 400 base pairs) 
were sequenced using the dideoxy chain termination method with ^^s-dATP and Sequenase 2.0 (United 
States Biochemicals). 



Primer 


Seq Id. No. 


Sequence 


AB111 
RS348 
RS349 


SEQ ID NO: 91 
SEQ ID NO: 142 
SEQ ID NO: 143 


5'GGGATOCGAGAGTGGOGCOTOOGOTCAT 
5'AOAOAGGAAAOAGCTATGAOOATG 
5'OCAGGGI 1 1 lOOOAGTOAOGAO 



The 10 new alleles, the populations in which each of the 10 new alleles was found, and the allele 
frequency within that population are shown in the Table below. For each allele, two designations are given. 
The first is the official name assigned by the WHO Nomenclature committee: the second name shown 
below in parentheses is an equivalent name. The nucleotide sequences of these 10 new alleles are shown 
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in allele sequence tables, above, and In the Sequence Listing section. The sequences have been submitted 
to the Genbank nucleotide sequence database and have been assigned the accession numbers M84617- 
M84626. 



Newly Discovered Alleles 


Allele 


ropuiaiion 




Decimal Value 


DPBr2801 (DPB21) 


SE Asians 


11/216 






Indonesians 




.007 


DPB1*3101 (DPB22) 


SE Asians 


1/216 


.005 




Indonesians 


12/276 


.043 




Qambians 


Of 1 £,0^ 


.002 


DPBr2701 (DPB23) 


Hispanics 


2/200 


.01 




African Americans 


4/292 


.014 




New Guineans 




.004 




Gambians 


27/1284 


-021 


DPBi 3201 (UPd24) 


New ouineans 




.004 


DPB1*3301 (DPB25) 


Hispanics 


1/200 


.005 


DPB1*3401 ^DPB26> 


Hispanics 


1/200 


.005 




Mexican Americans 


1/200 


.005 


DPBr2901 (DPB27) 


African Americans 


1/200 


.005 




New Guineans 


3/268 


.011 




Indonesians 


1/276 


.004 


DPBr3001 (DPB28) 


African Americans 


1/162 


.006 




Gambians 


5/1284 


.004 




Sudanese 


8/? 


? 


DPBr3501 (DPB29) 


Gambians 


6/1284 


.005 




Portuguese 


1/? 


? 


DPBr2101 (DPB30) 


SE Asians 


10/216 


.046 




Aus Aborigines 


1/34 


.029 



Most of the alleles appear to be relatively infrequent (<5%) in the populations examined. In alt but two 
instances, DPB1*3201 and DPBr3301. each allele was found in a minimum of two unrelated individuals. In 
general, these 10 new alleles display the same patchwork pattem of polymorphism identified in the 20 
DPB1 alleles described in the previous examples. DPB1*3201 is unique in that it has a single nucleotide 
substitution (A -> T) in the codon specifying amino acid 57 resulting in a valine residue at this position and 
a new sequence motif (DEV) in the third region (region C) of variability. The probe AB112 (SEQ ID NO: 92) 
was designed to detect this new sequence motif. The sequence of AB112 (SEQ ID No: 92) is shown below: 



AB112 



SEQ ID NO: 92 



5' XCCTGATGAGGTGTACTG 



where X indicates HRP. This probe has been used to type close to 2.000 individuals in a variety of 
populations, and on so far it has only hybridized with the original sample in which the DEV motif was found 
suggesting it is a rare variant. 

Two of the alleles. DPBI'3101 and DPB1*3401, have a single base alteration (G -> T) at nucleotide 
position 214 resulting in a single amino acid change (V -> L) at position 72. This is the first example of a 
polymorphic position outside of the six positions herein designated A through F. 

This DPBI typing assay consists of 25 sequence specific oligonucleotide probes: 4 for Region A 
(LFQG. VYQL, VYQG. VHQL). 5 for Region B (EEFARF, EELVRF. QEYARF, EEYARF. and EEFVRF) 4 for 
Region C (AAE. DEE, EAE, DED), 5 for Region D (l-K, 1-E. L-K, L-E. L-R), 3 for Region E (M. V, I), and 2 for 
Region F (GGPM,VGPM, DEAV). With the original 20 DPBI alleles that were known before the discovery of 
the 10 new alleles discussed above, this assay was able to distinguish all but 6 of the 190 heterozygous 
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genotypes (with 20 alleles there are 210 possible genotypes, 190 of which are heterozygous). The addition 
of the 10 new alleles reported here increases the number of DPB1 alleles to 30, and the number of possible 
genotypes to 465. The addition of these new alleles introduces additional ambiguities into the assay such 
that certain heterozygous combinations may have Indistinguishable probe hybridization patterns. However, 
with the use of the 25 probes typing assay plus the probe for the DEV motif (AB112 - SEQ ID No: 92) all 
but 9 pairs of genotypes (18 genotypes) are distingnishable. 

With the study of more populations, it is expected that new alleles will be discovered. The methods of 
the present invention provide means to detect new alleles and are not limited to the alleles described 
above. Rapid and accurate DPB1 typing is accomplished by the selection of suitable probe panels by the 
methods described herein. 



35 



EP 0 575 845 A2 



SEQUENCE LISTING 



GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: F • HOFFMANN- LA ROCHE AG 

(B) STREET: Grenzacherstrasse 124 

(C) CITY: Basel 

(D) STATE: BS 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP) : CH-4002 

(G) TELEPHONE: (0)61 - 688 24 03 
<H) TELEFAX: (0)61 - 688 13 95 
(I) TELEX: 962292/965542 hlr ch 

(ii) TITLE OF INVENTION: Method for HLA-DP typing 

(iii) NUMBER OF SEQUENCES: 157 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/903,028 

(B) FILING DATE: 23-JUN-1992 



INFORMATION FOR SEQ ID N0:1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Asp His Val Ser Thr Tyr Ala Ala Phe Val Gin Thr His Arg Pro Thr 
15 10 15 

Gly Glu Phe Met Phe Glu Phe Asp Glu Asp Glu Met Phe Tyr Val Asp 
20 25 30 

Leu Asp Lys Lys Glu Thr Val Trp His Leu Glu Glu Phe Gly Gin Ala 
35 40 45 

Phe Ser Phe Glu Ala Gin Gly Gly Leu Ala Asn lie Ala lie Leu Asn 
50 55 60 

Asn Asn Leu Asn Thr Leu lie Gin Arg Ser Asn His Thr Gin Ala Thr 
65 70 75 80 

Asn 
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) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Asp His Val Ser Thr Tyr Ala Ala Phe Val Gin Thr His Arg Pro Thr 
15 10 15 

Gly Glu Phe Met Phe Glu Phe Asp Glu Asp Glu Gin Phe Tyr Val Asd 
20 25 30 

Leu Asp Lys Lys Glu Thr Val Trp His Leu Glu Glu Phe Gly Arg Ala 

40 45 

Gly Gly Leu Ala Asn He Ala He Leu Asn 
55 60 

He Gin Arg Ser Asn His Thr Gin Ala Ala 
75 80 

Asn 



Phe Ser Phe Glu Ala Gin 
50 

Asn Asn Leu Asn Thr Leu 
65 70 



INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Asp His Val Ser Thr Tyr Ala Glu Phe Val Gin Thr His Arg Pro Ser 
i 5 10 15 

Gly Glu Tyr Met Phe Glu Phe Asp Glu Glu Glu Gin Phe Tyr Val Asn 
20 25 30 

Leu Asp Glu Lys Glu Met Val Trp Pro Leu Pro Glu Phe He His Thr 
35 40 45 

Phe Asp Phe Gly Ala Gin Arg Gly He Ala Gly He Val Met Ala Arg 
50 55 60 

Lys His Leu Asn Thr Arg He Asn Gly- Lys Gin Thr Trp Ala Thr Aso 
65 70 75 80 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

5 <C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asn Ser Val Tyr Gin Glu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
70 1 5 10 15 

Gin Arg Val Val Asp Gly Leu He Tyr Asn Arg Glu Glu Tyr Val His 
20 25 30 

Phe Asp Ala Asp Val Gly Glu Leu Arg Ala Met Thr Glu Leu Gly Arg 
75 3 5 4 0 4 5 

Pro He Gly Glu Tyr Phe Asn Ser Gin Lys Asp Phe Met Glu Arg Lys 
50 55 60 

Arg Ala Glu Val Asp Lys Val Cys Arg His Lys Tyr Glu Leu Met Glu 
20 65 70 75 80 

Pro Leu He Arg Gin Arg Arg 
85 

(2) INFORMATION FOR SEQ ID N0:5: 
25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTGTACCAGG GACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGGAGGA GTACGCGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 

35 GTGACGGAGC TGGGGCGGCC TGCTGCGGAG TACTGGAACA GCCAGAAGGA CATCCTGGAG 180 

GAGAAGCGGG CAGTGCCGGA CAGGGTATGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 

ACCCTGCAG 249 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
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Val Tyr Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arg 
1 5 10 15 

Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Tyr Ala Arg Phe Asp 
^ 20 25 30 

Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Ala 
35 40 45 

Ala Glu Tyr Trp Asn Ser Gin Lys Asp He Leu Glu Glu Lys Aro Ala 
50 55 60 

Val Pro Asp Arg Val Cys Arg His Asn Tyr Glu Leu Asp Glu Ala Val 
65 70 75 80 

Thr Leu Gin 

(2) INFORMATION FOR SEQ ID NO: 7: • 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

25 AGAATTACCT TTTCCAGGGA CGGCAGGAAT GCTACGCGTT TAATGGGACA CAGCGCTTCC 60 

TGGAGAGATA CATCTACAAC CGGGAGGAGT TCGTGCGCTT CGACAGCGAC GTGGGGGAGT 120 

TCCGGGCGGT GACGGAGCTG GGGCGGCCTG ATGAGGAGTA CTGGAACAGC CAGAAGGACA 180 

30 TCCTGGAGGA GGAGCGGGCA GTGCCGGACA GGATGTGCAG ACACAACTAC GAGCTGGGCG 240 
GGCCCATGAC CCTGCAG 



76 



20 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



40 



45 



50 



Asn Tyr Leu Phe Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
15 10 15 

Gin Arg Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Val Arg 
20 25 30 

Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg 
35 40 45 

Pro Asp Glu Glu Tyr Trp Asn Ser Gin Lys Asp He Leu Glu Glu Glu 
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70 



50 55 60 

Arg Ala Val Pro Asp Arg -Met Cys Arg His Asn ,Tyr Glu Leu Gly Gly 
65 70- 75 80 

Pro Met Thr Leu Gin Arg Arg 
85 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

75 CTTTTCCAGG GACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGGAGGA GCTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 

GTGACGGAGC TGGGGCGGCC TGAGGCGGAG TACTGGAACA GCCAGAAGGA CATCCTGGAG 180 

20 GAGGAGCGGG CAGTGCCGGA CAGGATGTGC AGACACAACT ACGAGCTGGG CGGGCCCATG 240 

ACCCTGCAG 249 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



30 



35 



40 



45 



Asn Tyr Leu Phe Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
15 10 15 

Gin Arg Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Ala Arg 
20 25 30 

Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg 
35 40 45 

Pro Ala Ala Glu Tyr Trp Asn Ser Gin Lys Asp Leu Leu Glu Glu Lys 
50 55 60 

Arg Ala Leu Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Asp Glu 
65 70 75 80 

Ala Val Thr Leu Gin 
85 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 
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10 



20 



25 



30 



(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : -single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GTGTACCAGT TACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGGAGGA GTTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 

GTGACGGAGC TGGGGCGGCC TGATGAGGAC TACTGGAACA GCCAGAAG6A CCTCCTGGAG 180 

GAGAAGCGGG CAGTGCCGGA CAGGGTATGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 

ACCCTGCAG 



35 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Val Tyr Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arg 
1 5 10 15 

Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Phe Val Arg Phe Asp 
20 25 30 

Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Asd 
35 40 45 

Glu Asp Tyr Trp Asn Ser Gin Lys Asp Leu Leu Glu Glu Lys Arg Ala 
50 55 60 

Val Pro Asp Arg Val Cys Arg His Asn Tyr Glu Leu Asp Glu Ala Val 
65 70 75 80 

Thr Leu Gin 



249 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AGAATTACCT TTTCCAGGGA CGGCAGGAAT GCTACGCGTT TAATGGGACA CAGCGCTTCC 

TGGAGAGATA CATCTACAAC CGGGAGGAGT TCGCGCGCTT CGACAGCGAC GTGGGGGAGT 
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TCCGGGCGGT GACGGAGCTG GGGCGGCCTG CTGCGGAGTA CTGGAACAGC CAGAAGGACA 180 

TCCTGGAGGA GAAGCGGGCA GTGCCGGACA GGATGTGCAG ACACAACTAC GAGCTGGGCG 240 

5 GGCCCATGAC CCTGCAGCGC CGAG 264 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

70 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Asn Tyr Leu Phe Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
T5 1 5 10 15 

Gin Arg Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Ala Arg 
20 25 30 

Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg 
20 35 40 45 

Pro Ala Ala Glu Tyr Trp Asn Ser Gin Lys Asp He Leu Glu 'Glu Lys 
50 55 60 

Arg Ala Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Gly Gly 
25 65 70 75 80 

Pro Met Thr Leu Gin Arg Arg 
85 

(2) INFORMATION FOR SEQ ID NO: 15: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CTTTTCCAGG GACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGGAGGA GTTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 

40 GTGACGGAGC TGGGGCGGCC TGATGAGGAG TACTGGAACA GCCAGAAGGA CATCCTGGAG 180 

GAGAAGCGGG CAGTGCCGGA CAGGATGTGC AGACACAACT ACGAGCTGGG CGGGCCCATG 240 

ACCCTGCAGC GCCGAG 256 

45 (2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Leu Phe Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Aro 
1 5 10 15 

Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Phe Val Arg Phe Asd 
20 25 30 

Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Asn 
35 40 45 

Glu Glu Tyr Trp Asn Ser Gin Lys Asp He Leu Glu Glu Lys Arg Ala 
50 55 60 

Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Gly Gly Pro Met 
70 75 80 

Thr Leu Gin 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CTTTTCCAGG GACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGGAGGA GCTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 

GTGACGGAGC TGGGGCGGCC TGAGGCGGAG TACTGGAACA GCCAGAAGGA CATCCTGGAG 180 

GAGAAGCGGG CAGTGCCGGA CAGGATGTGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 
ACCCTGCAG 



(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



Leu Phe Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Ar<x 
1 5 10 — 
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Phe 



Leu Glu 



Arg 
20 



Tyr 



He Tyr Asn Arg Glu Glu Leu Val Arg Phe Asp 



25 30 



Ser 



Asp Val 
35 



Gly 



Glu 



Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Glu 
40 45 



Ala Glu Tyr Trp Asn Ser Gin Lys Asp He Leu Glu Glu Lys Arg Ala 
50 55 60 

Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Asp Glu Ala Val 
65 70 75 80 

Thr Leu Gin 

(2) INFORMATION FOR SEQ ID NO: 19: 
(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single* 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GTGTACCAGT TACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGGAGGA GTTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 

GTGACGGAGC TGGGGCGGCC TGATGAGGAC TACTGGAACA GCCAGAAGGA CCTCCTGGAG 180 

GAGGAGCGGG CAGTGCCGGA CAGGATGTGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 

ACCCTGCAG 249 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Val Tyr Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arg 
15 10 15 

Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Val Arg Phe Asp 
20 25 30 

Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Asp 
35 40 45 

Glu Asp Tyr Trp Asn Ser Gin Lys Asp Leu Leu Glu Glu Glu Arg Ala 
50 55 60 

Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Asp Glu Ala Val 
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10 



30 



35 



Thr Leu Gin 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21- 



15 



Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Val Arg Phe Asp 
20 25 30 

Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Aso 
35 40 45 

Glu Glu Tyr Trp Asn Ser Gin Lys Asp He Leu Glu Glu Glu Arg Ala 
50 55 60 

Val Pro Asp Arg Val Cys Arg His Asn Tyr Glu Leu Asp Glu Ala Val 
70 75 80 

^ Thr Leu Gin 

(2) INFORMATION FOR SEQ ID NO:23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



249 



CTTTTCCAGG GACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 
TACATCTACA ACCGGGAGGA GTTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 
GTGACGGAGC TGGGGCGGCC TGATGAGGAG TACTGGAACA GCCAGAAGGA CATCCTGGAG 180 
GAGGAGCGGG CAGTGCCGGA CAGGGTATGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 
ACCCTGCAG 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 

Leu Phe Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Aro 
1 5 10 — 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GTGCACCAGT TACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

5 TACATCTACA ACCGGGAGGA GTTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 

GTGACGGAGC TGGGGCGGCC TGATGAGGAC TACTGGAACA GCCAGAAGGA CATCCTGGAG 180 

GAGGAGCGGG CAGTGCCGGA CAGGGTATGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 

70 ACCCTGCAG 249 

(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino apid 

75 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Val His Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arg 
20 1 5 10 15 

Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Phe Val Arg Phe Asp 
20 25 30 



25 



30 



35 



40 



45 
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Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Asp 
35 40 45 

Glu Asp Tyr Trp Asn Ser Gin Lys Asp lie Leu Glu Glu Glu Arg Ala 
50 55 60 

Val Pro Asp Arg Val Cys Arg His Asn Tyr Glu Leu Asp Glu Ala Val 
65 70 75 80 

Thr Leu Gin 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

GTGCACCAGT TACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGGAGGA GTTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 

GTGACGGAGC TGGGGCGGCC TGATGAGGAG TACTGGAACA GCCAGAAGGA CATCCTGGAG 180 

GAGGAGCGGG CAGTGCCGGA CAGGGTATGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 

ACCCTGCAG 249 
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(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 aiino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOUXSY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Val His GXn Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Aro 
1 5 10 ^ 
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Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Val Arq Phe Asd 
20 25 30 

Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Asp 
35 40 45 

Glu Glu Tyr Trp Asn Ser Gin Lys Asp He Leu Glu Glu Glu Arg Ala 
50 55 60 

Val Pro Asp Arg Val Cys Arg His Asn Tyr Glu Leu Asp Glu Ala Val 
70 75 80 

Thr Leu Gin 

(2) INFORMATION FOR SEQ ID NO; 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

GTGTACCAGT TACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGCAGGA GTACGCGCGC TTCGACAGCG ACGTGGGAGA GTTCCGGGCG 120 

GTGACGGAGC TGGGGCGGCC TGCTGCGGAG TACTGGAACA GCCAGAAGGA CCTCCTGGAG 180 

GA(3AGGCGGG CAGTGCCGGA CAGGATGTGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 
ACCCTGCAG 



(2) INFORMATION FOR SEQ ID NO:28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
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Val Tyr Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arg 
15 10 15 

5 Phe Leu Glu Arg Tyr lie Tyr Asn Arg Gin Glu Tyr Ala Arg Phe Asp 

20 25 30 

Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Ala 
35 40 45 

70 Ala Glu Tyr Trp Asn Ser Gin Lys Asp Leu Leu Glu Glu Arg Arg Ala 

50 55 60 

Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Asp Glu Ala Val 
65 70 75 80 



75 



20 



Thr Leu Gin 

(2) INFORMATION FOR SEQ ID NO: 29: " 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

25 GTGTACCAGT TACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGGAGGA GTACGCGCGC TTCGACAGCG ACGTGGGAGA GTTCCGGGCG 120 

GTGACGGAGC TGGGGCGGCC TGCTGCGGAG TACTGGAACA GCCAGAAGGA CATCCTGGAG 180 

GAGGAGCGGG CAGTGCCGGA CAGGATATGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 

ACCCTGCAG 249 

(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 83 amino acids 

<B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 



40 



Val Tyr Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arg 
15 10 15 

Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Tyr Ala Arg Phe Asp 
20 25 30 

Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Ala 
35 40 45 

Ala Glu Tyr Trp Asn Ser Gin Lys Asp lie Leu Glu Glu Glu Arg Ala 
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50 55 



60 



Val Pro Asp Arg lie Cys* Arg His Asn Tyr Glu Leu Asp Glu Ala Val 
^5 70 75 80 

Thr Leu Gin 

(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 31: 



75 



20 



(2) INFORMATION FOR SEQ ID NO: 32: 
25 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

30 SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



Val His Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arc 
1 S 10 15 

Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Val Arg Phe Asp 
35 20 25 30 

Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Asp 
35 40 45 

Glu Asp Tyr Trp Asn Ser Gin Lys Asp Leu Leu Glu Glu Lys Arg Ala 
40 55 60 

Val Pro Asp Arg Val Cys Arg His Asn Tyr Glu Leu Asp Glu Ala Val 

oj 70 ->.- 



45 



75 80 

Thr Leu Gin 

(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 



60 



GTGCACCAGT TACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 
TACATCTACA ACCGGGAGGA GTTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 
GTGACGGAGC TGGGGCGGCC TGATGAGGAC TACTGGAACA GCCAGAAGGA CCTCCTGGAG 180 
GAGAAGCGGG CAGTGCCGGA CAGGGTATGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 
ACCCTGCAG 
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(A) LENGTH: 249 base pairs 

(B) TYPE: nucXeic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GTGTACCAGG GACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGCAGGA GTACGCGCGC TTCGACAGCG ACGTGGGAGA GTTCCGGGCG 120 

10 GTGACGGAGC TGGGGCGGCC TGCTGCGGAG TACTGGAACA GCCAGAAGGA CCTCCTGGAG 180 

GAGAGGCGGG CAGTGCCGGA CAGGATGTGC AGACACAACT ACGAGCTGGT CGGGCCCATG 240 

ACCCTGCAG 249 

75 (2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: " 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
20 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Val Tyr Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arg 
15 10 15 



25 Phe Leu Glu Arg Tyr lie Tyr Asn Arg Gin Glu Tyr Ala Arg Phe Asp 

20 25 30 

Ser A^p Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Ala 
35 40 45 

30 Ala Glu Tyr Trp Asn Ser Gin Lys Asp Leu Leu Glu Glu Arg Arg Ala 

50 55 60 

Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Val Gly Pro Met 
65 70 75 80 

35 Thr Leu Gin 



(2) INFORMATION FOR SEQ ID NO: 35: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

45 CTTTTCCAGG GACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGGAGGA GTTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 
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GTGACGGAGC TGGGGCGGCC TGATGAGGAG TACTGGAACA GCCAGAAGGA CATCCTGGAG 180 
GAGGAGCGGG CAGTGCCGGA CAGGATGTGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 
ACCCTGCAG 



(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



Leu 
1 


Phe 


Gin 


Gly Arg Gin Glu Cys Tyr Ala Phe 
5 10 


Asn 


Gly Thr Gin Arg 
15 


Phe 


Leu 


Glu 


Arg Tyr He Tyr Asn Arg Glu Glu 
20 25 


Phe 


Val Arg Phe Asp 
30 


Ser 


Asp 


Val 
35 


Gly Glu Phe Arg Ala Val Thr Glu 
40 


Leu 


Gly Arg Pro Asp 
45 


Glu 


Glu 
50 


Tyr 


Trp Asn Ser Gin Lys Asp He Leu 
55 


Glu 
60 


Glu Glu Arg Ala 


Val 
65 


Pro 


Asp 


Arg Met Cys Arg His Asn Tyr Glu Leu 
70 75 


Asp Glu Ala Val 
80 


Thr 


Leu 


Gin 









(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 



249 



(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

GTGCACCAGT TACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

TACATCTACA ACCGGGAGGA GTTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 

GTGACGGAGC TGGGGCGGCC TGATGAGGAC TACTGGAACA GCCAGAAGGA CATCCTGGAG 180 

GAGGAGCGGG CAGTGCCGGA CAGGATGTGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 240 
ACCCTGCAG 



249 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Val His Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arg 
15 10 15 

Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Phe Val Arg Phe Asp 
10 20 25 30 

Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Asp 
35 40 45 

Glu Asp Tyr Trp Asn Ser Gin Lys Asp lie Leu Glu Glu Glu Arg Ala 
75 50 55 60 

Val Pro Asp Arg Met Cys Arg Hts Asn Tyr Glu Leu Asp Glu Ala Val 
65 70 75 80 

Thr Leu Gin 

20 

(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GTGTACCAGG GACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 60 

30 

TACATCTACA ACCGGGAGGA GTTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 120 

GTGACGGAGC TGGGGCGGCC TGATGAGGAG TACTGGAACA GCCAGAAGGA CATCCTGGAG 180 

GAGAAGCGGG CAGTGCCGGA CAGGATGTGC AGACACAACT ACGAGCTGGT CGGGCCCATG 240 

35 

ACCCTGCAG 249 

(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 
40 (B) TYPE: amino acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

45 Val Tyr Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arg 

15 10 15 
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Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Val Arg Phe Asp 

25 30 

Ser ASP val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Asp 

40 45 . 

Glu Glu Tyr Trp Asn Ser Gin Lys Asp He Leu Glu Glu Lys Arg Ala 



60 



val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Val Gly Pro Met 

'^^ 80 

Thr Leu Gin 

(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CTTTTCCAGG GACGGCAGGA ATGCTACGCG TTTAATGGGA CACAGCGCTT CCTGGAGAGA 
TACATCTACA ACCGGGAGGA GTTCGTGCGC TTCGACAGCG ACGTGGGGGA GTTCCGGGCG 
GTGACGGAGC TGGGGCGGCC TGAGGCGGAG TACTGGAACA GCCAGAAGGA CATCCTGGAG 
GAGGAGCGGG CAGTGCCGGA CAGGATATGC AGACACAACT ACGAGCTGGA CGAGGCCGTG 
ACCCTGCAG 

(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE -DESCRIPTION: SEQ ID NO:42: 

Leu Phe Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arg 

^0 15 

Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Val Arg Phe Asp 

25 30 

Ser Asp val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Glu 

40 45 

Ala Glu Tyr Trp Asn Ser Gin Lys Asp He Leu Glu Glu Glu Arg Ala 



60 
120 
180 
240 
249 
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Val Pro Asp Arg lie Cys Arg His Asn Tyr Glu Leu Asp Glu Ala Val 
65 70 75 80 

Thr Leu Gin 

5 

(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Val His Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr Gin Arg 
75 1 5 10 15 

Phe Leu Glu Arg Tyr lie Tyr Asn*Arg Glu Glu Phe Val Arg Phe 'Asp 
20 25 30 



20 



25 



Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg Pro Asp 
35 40 45 

Glu Asp Tyr Trp Asn Ser Gin Lys Asp Leu Leu Glu Glu Lys Arg Ala 
50 55 60 

Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Asp Glu Ala Val 
65 70 75 80 

Thr Leu Gin 

30 (2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

AGAATTACCT TTTCCAGGGA CGGCAGGAAT GCTACGCGTT TAATGGGACA CAGCGCTTCC 60 

TGGAGAGATA CATCTACAAC CGGGAGGAGT TCGCGCGCTT CGACAGCGAC GTGGGGGAGT 120 

TCCGGGCGGT GACGGAGCTG GGGCGGCCTG ATGAGGAGTA CTGGAACAGC CAGAAGGACC 180 

TCCTGGAC^GA GAAGCGGGCA GTGCCC5GACA GGATGTGCAG ACACAACTAC GAGCTGGTCG 240 

GGCCCATGAC CCTGCAG 257 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45; 

Asn Tyr Leu Phe Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Glv Thr 
1 5 10 

Gin Arg Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Phe Ala Ara 
20 25 30 

Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Glv Arc 
35 40 45 

Pro Asp Glu Glu Tyr Trp Asn Ser Gin Lys Asp Leu Leu Glu Glu Lvs 

55 60 

Arg Ala Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Val Gly 

80 

Pro Met Thr Leu Gin 
85 

(2) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 46: 

AGAATTACCT TTTCCAGGGA CGGCAGGAAT GCTACGCGTT TAATGGGACA CAGCGCTTCC 60 

TGGAGAGATA CATCTACAAC CGGGAGGAGT TCGCGCGCTT CGACAGCGAC GTGGGGGAGT 120 

TCCGGGCGGT GACGGAGCTG GGGCGGCCTG CTGCGGAGTA CTGGAACAGC CAGAAGGACC 180 

TCCTGGAGGA GAAGCGGGCA TTGCCGGACA GGATGTGCAG ACACAACTAC GAGCTGGACG 240 

AGGCCGTGAC CCTGCAG 

257 

(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Asn Tyr Leu Phe Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
1 5 10 



15 



Gin Arg Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Ala Arg 
20 25 30 
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T5 



Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg 
35 40 45 

Pro Ala Ala Glu Tyr Trp Asn Ser Gin Lys Asp Leu Leu Glu Glu Lys 
50 55 60 

Arg Ala Leu Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Asp Glu 
65 70 75 80 

Ala Val Thr Leu Gin 
85 

(2) INFORMATION FOR SEQ ID NO: 48: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ti) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

AGAATTACGT GTACCAGTTA CGGCAGGAAT GCTACGCGTT TAATGGGACA CAGCGCTTCC 60 

TGGAGAGATA CATCTACAAC CGGGAGGAGT ACGCGCGCTT CGACAGCGAC GTGGGGGAGT 120 

TCCGGGCGGT GACGGAGCTG GGGCGGCCTG CTGCGGAGTA CTGGAACAGC CAGAAGGACA 180 

TCCTGGAGGA GAAGCGGGCA GTGCCGGACA GGATGTGCAG ACACAACTAC GAGCTGGACG 240 

AGGCCGTGAC CCTGCAG 257 

(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Asn Tyr Val Tyr Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
15 10 15 

Gin Arg Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Tyr Ala Arg 
20 25 30 

Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg 
35 40 45 

Pro Ala Ala Glu Tyr Trp Asn Ser Gin Lys Asp lie Leu Glu Glu Lys 
50 55 60 

Arg Ala Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Asp Glu 
65 70 75 80 



20 



25 



30 



35 



50 
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70 



35 



Ala Val Thr Leu Gin 
85 

(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



AGAATTACCT TTTCCAGGGA CGGCAGGAAT GCTACGCGTT TAATGGGACA CAGCGCTTCC 60 

TGGAGAGATA CATCTACAAC CGGGAGGAGT TCGTGCGCTT CGACAGCGAC GTGGGGGAGT 120 

75 TCCGGGCGGT GACGGAGCTG GGGCGGCCTG ATGAGGTGTA CTGGAACAGC CAGAAGGACA 180 

TCCTGGAGGA GGAGCGGGCA GTGCCGGACA GGATGTGCAG ACACAACTAC GAGCTGGGCG 240 

GGCCCATGAC CCTGCAG 257 

20 (2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
25 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Asn Tyr Leu Phe Gin Qly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
15 10 15 



30 Gin Arg Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Val Ara 

20 25 30 

Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Ara 
35 40 45 

Pro Asp Glu Val Tyr Trp Asn Ser Gin Lys Asp He Leu Glu Glu Glu 
50 55 60 

Arg Ala Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Gly Gly 
70 75 80 

40 Pro Met Thr Leu Gin 

85 

(2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 
46 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

AGAATTACCT TTTCCAGGGA CGGCAGGAAT GCTACGCGTT TAATGGGACA CAGCGCTTCC 60 

TGGAGAGATA CATCTACAAC CGGGAGGAGT TCGCGCGCTT CGACAGCGAC GTGGGGGAGT 120 

TCCGGGCGGT GACGGAGCTG GGGCGGCCTG CTGCGGAGTA CTGGAACAGC CAGAAGGACA 180 

TCCTGGAGGA GGAGCGGGCA GTGCCGGACA GGATGTGCAG ACACAACTAC GAGCTGGGCG 240 

GGCCCATGAC CCTGCAG 257 



(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CJIARACTERISTICS : 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

'5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide - 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Asn Tyr Leu Phe Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
20 1 5 10 15 

Gin Arg Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Phe Ala Arg 
20 25 30 

Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg 
25 35 ,40 45 

Pro Ala Ala Glu Tyr Trp Asn Ser Gin Lys Asp lie Leu Glu Glu Glu 
50 55 60 

Arg Ala Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Gly Gly 
30 65 70 75 80 

Pro Met Thr Leu Gin 
85 

(2) INFORMATION FOR SEQ ID NO: 54: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

AGAATTACCT TTTCCAGGGA CGGCAGGAAT GCTACGCGTT TAATGGGACA CAGCGCTTCC 60 

TGGAGAGATA CATCTACAAC CGGGAGGAGC TCGTGCGCTT CGACAGCGAC GTGGGGGAGT 120 

45 TCCGGGCGGT GACGGAGCTG GGGCGGCCTG CTGCGGAGTA CTGGAACAGC CAGAAGGACC 180 

TCCTGGAGGA GAAGCGGGCA TTGCCGGACA GGATGTGCAG ACACAACTAC GAGCTGGTCG 240 
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GGCCCATGAC CCTGCAG 257 

(2) INFORMATION FOR SEQ ID NO: 55: 
(i) SEQUENCE CHARACTEillSTICS : 
5 (A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

70 

Asn Tyr Leu Phe Gin Gly Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
IS 10 15 



Gin Arg Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Leu Val Arg 
20 25 30 



Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg 
35 40 45 



Pro Ala Ala Glu Tyr Trp Asn Ser Gin Lys Asp Leu Leu Glu Glu Lys 

50 55 60 

Arg Ala Leu Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Val Gly 
65 70 75 80 



Pro Met Thr Leu Gin 
85 

25 

(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

AGAATTACGT GTACCAGTTA CGGCAGGAAT GCTACGCGTT TAATGGGACA CAGCGCTTCC 60 

35 TGGAGAGATA CATCTACAAC CGGGAGGAGT TCGTGCGCTT CGACAGCGAC GTGGGGGAGT 120 

TCCGGGCGGT GACGGAGCTG GGGCGGCCTG ATGAGGACTA CTGGAACAGC CAGAAGGACC 180 

TCCTGGAGGA GGAGCGGGCA GTGCCGGACA GGGTATGCAG ACACAACTAC GAGCTGGACG 240 



40 AGGCCGTGAC CCTGCAG 257 

(2) INFORMATION FOR SEQ ID NO: 57: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
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Asn Tyr Val Tyr Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 

1 5 ^ 10 . 15 

Gin Arg Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Phe Val Arg 
5 20 25 30 

Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg 
35 40 45 

Pro Asp Glu Asp Tyr Trp Asn Ser Gin Lys Asp Leu Leu Glu Glu Glu 
70 5 0 5 5 60 

Arg Ala Val Pro Asp Arg Val Cys Arg His Asn Tyr Glu Leu Asp Glu 
65 70 75 80 

Ala Val Thr Leu Gin 
75 85 

(2) INFORMATION FOR SEQ ID NO: 58: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 



25 



30 



AGAATTACGT GCACCAGTTA CGGCAGGAAT GCTACGCGTT TAATGGGACA CAGCGCTTCC 60 

TGGAGAGATA CATCTACAAC CGGGAGGAGT TCGTGCGCTT CGACAGCGAC GTGGGGGAGT 120 

TCCGGGCGGT GACGGAGCTG GGGCGGCCTG AGGCGGAGTA CTGGAACAGC CAGAAGGACA 180 

TCCTGGAGGA GGAGCGGGCA GTGCCGGACA GGATGTGCAG ACACAACTAC GAGCTGGACG 240 

AGGCCGTGAC CCTGCAG 257 



(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
55 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Asn Tyr Val His Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
15 10 15 

Gin Arg Phe Leu Glu Arg Tyr He Tyr Asn Arg Glu Glu Phe Val Arg 
20 25 30 

Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg 
35 40 45 
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Pro Glu Ala Glu Tyr Trp Asn Ser Gin Lys Asp He - Leu Glu Glu Glu 
^0 55 . 60 

Arg Ala Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Asp Glu 
^5 70 75 

Ala Val Thr Leu Gin 
85 

(2) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

AGAATTACGT GCACCAGTTA CGGCAGGAAT CSCTACGCGTT TAATGGGACA CAGCGCTTCC 60 

TGGAGAGATA CATCTACAAC CGGGAGGAGT TCGTGCGCTT CGACAGCGAC GTGGGGGAGT 120 

TCCGGGCGGT GACGGAGCTG GGGCGGCCTG ATGAGGACTA CTGGAACAGC CAGAAGGACA 180 

TCCTGGAGGA GAAGCGGGCA GTGCCGGACA GGGTATGCAG ACACAACTAC GAGCTGGACG 240 
AGGCCGTGAC CCTGCAG 



20 



25 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



Asn Tyr Val His Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
1 5 10 



15 



Gin Arg Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Phe Val Arg 
20 25 30 

Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Arg 
35 40 45 

Pro Asp Glu Asp Tyr Trp Asn Ser Gin Lys Asp He Leu Glu Glu Lys 
50 55 60 

Arg Ala Val Pro Asp Arg Val Cys Arg His Asn Tyr Glu Leu Asp Glu 
70 75 80 

^ Ala Val Thr Leu Gin 

85 
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25 



30 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 62: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: DNA (genomic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

AGAATTACGT GTCCCAGTTA CGGCAGGAAT GCTACGCGTT TAATGGGACA CAGCGCTTCC 60 

TGGAGAGATA CATCTACAAC CGGGAGGAGC TCGTGCGCTT CGACAGCGAC GTGGGGGAGT 120 

TCCGGGCGGT GACGGAGCTG GGGCGGCCTG AGGCGGAGTA CTGGAACAGC CAGAAGGACA 180 

TCCTGGAGGA GGAGCGGGCA GTGCCGGACA GGATGTGCAG ACACAACTAC GAGCTGGACG 240 

AGGCCGTGAC CCTGCAG 257 



(2) INFORMATION FOR SEQ ID NO: 63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63; 



Asn Tyr Val His Gin Leu Arg Gin Glu Cys Tyr Ala Phe Asn Gly Thr 
IS 10 15 

Gin Arg Phe Leu Glu Arg Tyr lie Tyr Asn Arg Glu Glu Leu Val Aro 
20 25 30 

Phe Asp Ser Asp Val Gly Glu Phe Arg Ala Val Thr Glu Leu Gly Aro 
35 40 45 

Pro Glu Ala Glu Tyr Trp Asn Ser Gin Lys Asp He Leu Glu Glu Glu 
50 55 60 

Arg Ala Val Pro Asp Arg Met Cys Arg His Asn Tyr Glu Leu Asp Glu 
65 70 75 80 

Ala Val Thr Leu Gin 
85 



(2) INFORMATION FOR SEQ ID NO: 64: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

^5 (ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
TACCTTTTCC AGGGACGG j^g 
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(2) INFORMATION FOR SEQ ID NO: 65: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 
TACGTGTACC AGTTACGG 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 
TACGTGTACC AGGGACGG 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67- 
TACGTGCACC AGTTACGG 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
CGGGAGGAGT TCGCGCGC 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69- 
CGGGAGGAGT TCGTGCGC 

(2) INFORMATION FOR SEQ ID NO: 70: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
CGGGAGGAGC TCGTGCGC 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
CGGCAGGAGT ACGCGCGC 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CGGGAGGAGT ACGCGCGC 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
CGGGAGGAAT TCGTGCGC 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 
CCTGCTGCGG AGTACTGG 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 
CCTGATGAGG AGTACTGG 

(2) INFORMATION FOR SEQ ID NO: 76: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
CCTGAGGCGG AGTACTGG 

(2) INFORMATION FOR SEQ ID NO: 77: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: genomic DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 77 
CCTGATGAGG ACTACTGG 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 
GACATCCTGG AGGAGAAG 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 
GACATCCTGG AGGAGGAG 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 
GACCTCCTGG AGGAGAAG 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
GACCTCCTGG AGGAGGAG 18 

(2) INFORMATION FOR SEQ ID JlO: 82: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

GACCTCCTGG AGGAGAGG 18 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 

75 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GACAGGATGT GCAGACAC 18 

20 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GACAGGGTAT GCAGACAC 18 

(2) INFORMATION FOR SEQ ID NO: 85; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
CTGGGCGGGC CCATGACC 18 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
45 CTGGACGAGG CCGTGACC 18 

(2) INFORMATION FOR SEQ ID NO: 87: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 

50 



55 



66 



EP 0 575 845 A2 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
CTGGTCGGGC CCATGACC 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
TGTCTGCACA TCCTGTCCG 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
TGTCTGCATA CCCTGTCCG 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
CGGACAGGAT ATGCAGACA 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
GGGATCCGAG AGTGGCGCCT CCGCTCAT 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
CCTGATGAGG TGTACTG 
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(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AGATGAGATG TTCTATG 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GTTTGGCCAA GCCTTTT 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AGATGAGCAG TTCTATG 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GTTTGGCCGA GCCTTTT 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CAGGGATCCG CAGAGAATTA C 

(2) INFORMATION FOR SEQ ID NO: 98: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 
GTCCTGCAGT CACTCACCTC GGCG 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99- 
GAATTACCTT TTCCAGGGA 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 
ATTACGTGTA CCAGTTACG 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 
CGTCCCTGGT ACACGTAAT 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 
CCTGCTGCGG AGTACTG 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 103: 
CAGTACTCCT CATCAGG 
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(2) INFORMATION FOR SEQ ID NO: 104: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
^ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
CAGTACTCCG CCTCAGG 17 

(2) INFORMATION FOR SEQ ID NO: 105: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

,5 (ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
CCTGATGAGG ACTACTG 17 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
25 GACATCCTGG AGGAGAAGC 19 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
GCTCCTCCTC CAGGATGTC 19 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
GACCTCCTGG AGGAGAAGC 19 

(2) INFORMATION FOR SEQ ID NO: 109: 
45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 
GCTCCTCCTC CAGGAGGTC 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 
ATTACGTGCA CCAGTTACG 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single - 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 
CGTAACTGGT ACACGTAAT 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
CTGCAGGGTC ATGGGCCCCC G 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
CTGCAGGGTC ACGGCCTCGT C 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
ATTACGTGTA CCAGTTA 

(2) INFORMATION FOR SEQ ID NO: 115: 
(i) SEQUENCE CHARACTERISTICS: 



71 



EP 0 575 845 A2 



10 



(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

CCTGAGGCGG AGTACTG ^"^ 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
75 GACCTCCTGG AGGAGGAG ^® 

(2) INFORMATION FOR SEQ ID NO: 117,: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
GACCTCCTGG AGGAGAGG ^® 

25 (2) INFORMATION FOR SEQ ID NO: 118: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

30 (ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
CTGGTCGGGC CCATGACC ^8 

(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
GAATTACCTT TTCCAGGGAC 20 

(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120 
TTACGTGTAC CTGGGAC 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 121 
ACATCCTGGA GGAGAAGC 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 
ACATCCTGGA GGAGGAGC 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123- 
ACCTCCTGGA GGAGAAGC 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRT^DEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124- 
CCTGATGAGG AGTACTG 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO* 125- 
CTGGGCGGGC CCATG 

(2) INFORMATION FOR SEQ ID NO: 126: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
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(B) TYPE: nucleic acid 

(C> STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
CTGGACGAGG CCGTG 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
GACCTCCTGG AGGAGGAGC 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS:* 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
GACCTCCTGG AGGAGAGGC 

(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 
AGCTGGGCGG GCCCATGAC 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 
AGCTGGACGA GGCCGTGAC 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 
CCAGTACTCC TCATCAGGC 
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(2) INFORMATION FOR SEQ ID NO: 132: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 hasp pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 132- 
ATTACGTGCA CCAGTTAC 

(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO' 133- 
ATTACGTGCA CCAGTTA 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 134- 
CAGTACTCCT CATCAG 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 135* 
CTGATGAGGA CTACTG 

(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 136- 
CGCTTCGACA GCGACGT 

(2) INFORMATION FOR SEQ ID NO: 137: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
CCGTCCCTGG AAAAGGTAAT TC 

(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 
GACCTCCTGN GAGGAGAGGC 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 
GACCTCCTGG AGNGAGGAGC 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 
CGCGGATCCT GTGTCAACTT ATGCCGC 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
CTGGCTGCAG TGTGGTTGGA ACGC 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 
ACACAGGAAA CAGCTATGAC CATG 



76 



EP 0 575 845 A2 



(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 
CCAGGGTTTT CCCAGTCACG AC 

(2) INFORMATION FOR SEQ ID NO: 144: 
ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144 
GCTGCAGGAG AGTGGCGCCT CCGCTCAT 

(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 
CGGATCCGGC CCAAAGCCCT CACTC 

(2) INFORMATION FOR SEQ ID NO: 14 6: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 6 
GCGGCATAAG TTGACACATG GTCCGCT 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147; 
GCGTTCCAAC CACACTCAGG CCAC 

(2) INFORMATION FOR SEQ ID NO: 148: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: genomic DMA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 
GTAATTCTCT GCGGGGAGGG G 21 

(2) INFORMATION FOR SEQ ID ^J0: 149: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
{B> TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
CGCCGAGGTG AGTGAGGGCT TTGG 24 



(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single , 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 
GACAGGATAT GCAGACAC ^8 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 
AGGAGTTCGC GCGCTT 

30 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 
AGGAGTTCGT GCGCTT 

(2) INFORMATION FOR SEQ ID NO: 153: 
40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

AGGAGCTCGT GCGCTTC 1'' 

(2) INFORMATION FOR SEQ ID NO: 154: 
(i) SEQUENCE CHARACTERISTICS: 

60 
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(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic' DN A 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 
CCGGCAGGAG TACGCGC 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155- 
GAGGAGTACG CGCGCT 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156* 
CGAGCTGGGC GGGCCCA 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157- 
CGAGCTGGTC GGGCCCA 



Clafms 

1. A process for determining an Individual's HLA DP genotype from a nucleic acid containing sample 
originating from the individual whose HLA-DP genotype is to be determined, which method comprises: 

(a) amplifying a target region of the nucleic acids in the sample under conditions suitable for 
carrying out a polymerase chain reaction, whereby the target region contains a polymorphic region 
(variable segment) of an HLA DP gene using a primer selected from the group consisting of primers 

ABlll SEQ ID NO: 91 5 GGGATCCGAGAGTGGCGCCTCCGCTCAT, 
RS348 SEQ ID NO: 142 5'ACACAGGAAACAGCTATGACC ATG, and 
RS349 SEQ ID NO: 143 5'CCAGGGTTTTCCCAGTCACGAC; 

(b) mixing the amplified nucleic acids with a panel of sequence specific oligonucleotide (880) 
probes, wherein each probe is complementary to a variant sequence of a variable segment of an 
HLA DP gene, under conditions wherein 880 probes bind to said amplified nucleic acids to form 
stable hybrid duplexes only If they are exactly complementary; and 

(c) detecting hybrids formed between the amplified nucleic acids and the 880 probes. 

2. The process of Claim 1, wherein the probes of the panel are selected from the group consisting of: 
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SEQ ID NO: 92 (5'CCTGATGAGGTGTACTG); 
SEQ ID NO: 99 (5'GAATTACCTTTTCCAGGGA); 
SEQ ID NO: 100 (5*ATTACGTGTACCAGTTACG); 
SEQ ID NO: 101 (5'CGTCCCTGGTACACGTAAT); 
SEQ ID NO: 102 (5'CCTGCTGCGGAGTACTG); 
SEQ ID NO: 103 (5'CAGTACTCCTCATCAGG); 
SEQ ID NO: 104 (5'CAGTACTCCGCCTCAGG); 
SEQ ID NO: 105 (5*CCTGATGAGGACTACTG); 
SEQ ID NO: 106 (5'GACATCCTGGAGGAGAAGC); 
SEQ ID NO: 107 (5'GCTCCTCCTCCAGGATGTC); 
SEQ ID NO: 108 (5'GACCTCCTGGAGGAGAAGC); 
SEQ ID NO: 109 (5*GCTCCTCCTCCAGGAGGTC); 
SEQ ID NO: 110 (5'ATTACGTGCACCAGTTACG); 
SEQ ID NO: 111 (5'CGTAACTGGTACACGTAAT); 



80 



EP 0 575 845 A2 



SEQ ID NO: 112 (5'CTGCAGGGTCATGGGCCCCCG); 
SEQ ID NO: 113 (5 CTGCAGGGTCACGGCCTCGTC); 
SEQ ID NO: 114 (5'ATTACGTGTACCAGTTA); 
SEQ ID NO: 115 (5*CCTGAGGCGGAGTACTG); 
SEQ ID NO: 116 (5'GACCTCCTGGAGGAGGAG); 
SEQ ID NO: 117 (5'GACCTCCTGGAGGAGAGG); 
SEQ ID NO: 118 (5'CTGGTCGGGCCCATGACC); 
SEQ ID NO: 119 (5'GAATTACCTTTTCCAGGGAC); 
SEQ ID NO: 120 (5TTACGTGTACCTGGGAC); 
SEQ ID NO: 121 (5'ACATCCTGGAGGAGAAGC); 
SEQ ID NO: 122 (5'ACATCCTGGAGGAGGAGC); 
SEQ ID NO: 123 (5'ACCTCCTGGAGGAGAAGC); 
SEQ ID NO: 124 (5'CCTGATGAGGAGTACTG); 
SEQ K) NO: 125 (5'CTGGGCGGGCCCATG); 
SEQ ID NO: 126 (5'CTGGACGAGGCCGTG); 
SEQ ID NO: 127 (5*GACCTCCTGGAGGAGGAGC); 
SEQ ID NO: 128 (5'GACCTCCTGGAGGAGAGGC); 
SEQ ID NO: 129 (5'AGCTGGGCGGGCCCATGAC); 
SEQ ID NO: 130 (5'AGCTGGACGAGGCCGTGAC); 
SEQ ID NO: 132 (5'ATTACGTGCACCAGTTAC); 
SEQ ID NO: 133 (5'ATTACGTGCACCAGTTA); 
SEQ ID NO: 134 (6'CAGTACTCCTCATCAG); 
SEQ ID NO: 135 (5'CTGATGAGGACTACTG); 
SEQ ID NO: 137 (5'CCGTCCCTGGAAAAGGTAATTC); 
SEQ ID NO: 138 (5'GACCTCCTGNGAGGAGAGGC); 
SEQ ID NO: 139 (5'GACCTCCTGGAGNGAGGAGC); 
SEQ ID NO: 151 (X-AGGAGTTCGCGCGCTT); 
SEQ ID NO: 152 (X-AGGAGTTCGTGCGCTT); 
SEQ ID NO: 153 {X-AGGAGCTCGTGCGCTTC); 
SEQ ID NO: 154 (X-CCGGCAGGAGTACGCGC); 
SEQ ID NO: 155 (X-GAGGAGTACGCGCGCT); 
SEQ ID NO: 156 (X-CGAGCTGGGCGGGCCCA); and 
SEQ ID NO: 157 (X-CGAGCTGGTCGGGCCCA). 



process of Claim 1 , wherein the probes of the panel are selected from the group consisting of: 
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SEQ ID NO: 92 (5'CCTGATGAGGTGTACTG); 
SEQ ID NO: 132 (5*ATTACGTGCACCAGTTAC); 



SEQ ID NO: 133 (5*ATTACGTGCACCAGTTA); 
SEQ ID NO: 134 (5 CAGTACTCCTCATCAG); 
SEQ ID NO: 135 (5'CTGATGAGGACTACTG); 
SEQ ID NO: 137 (5*CCGTCCCTGGAAAAGGTAATTC); 
SEQ ID NO: 138 (5'GACCTCCTGNGAGGAGAGGC); 
SEQ ID NO: 139 (5 GACCTCCTGGAGNGAGGAGC); 
SEQ ID NO: 151 (X-AGGAGTTCGCGCGCTT); 
SEQ ID NO: 152 (X-AGGAGTTCGTGCGCTT); 
SEQ ID NO: 153 (X-AGGAGCTCGTGCGCTTC); 
SEQ ID NO: 154 (X-CCGGCAGGAGTACGCGC); 
SEQ ID NO: 155 (X-GAGGAGTACGCGCGCT); 
SEQ ID NO: 156 (X-CGAGCTGGGCGGGCCCA); and 
SEQ ID NO: 157 (X-CGAGCTGGTCGGGCCCA). 



4. The process of Claim 1, wherein the individual's HLA DP genotype comprises an allele selected from 
the group consisting of DPB21. DPB22, DPB23, DPB24, DPB25. DPB26, DPB27. DPB28, DPB29. and 
DPB30. 

5. A process for determining an individuars susceptibility to an autoimmune disease comprising determin- 
ing the individual's HLA DP genotype according to the process of Claim 1 and determining whether the 
Individuars genotype is a genotype which is linked to an autoimmune disease. 

6. The process of Claim 5. wherein the autoimmune disease is pauciarticular juvenile rheumatoid arthritis, 
and wherein the genotype which is linked to an autoimmune disease comprises the DPB2.1 allele. 

7. The process of Claim 6, wherein the autoimmune disease is IDDM. 

a The process of Claim 6, wherein the autoimmune disease is CD, and wherein the genotype which is 
linked to an autoimmune disease comprises an allele selected from the group consisting of DPB13. 
DPB1, DPB3, and DPB4.2. 

9. A process of providing forensic evidence concerning the derivation of a sample which contains genomic 
nucleic acids comprising determining, according to the process In Claim 1 , the HLA DP genotypes of 
the sample and of a suspected individual, comparing the HLA DP genotypes of the individual and of the 
sample, and deducing whether the sample could have been derived from the individual. 

10. The process of Claim 1, wherein the panel of probes comprises probes with hybridizing regions: 
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SEQ ID NO: 88 (5TGTCTGCACATCCTGTCCG); 
SEQ ID NO: 89 (5TGTCTGCATACCCTGTCCG); 
SEQ ID NO: 90 (5'CGGACAGGATATGCAGACA); 
SEQ ID NO: 99 (5'GAATTACCTTTTCCAGGGA); 
SEQ ID NO: 101 (5'CGTCCCTGGTACACGTAAT); 
SEQ ID NO: 102 (5'CCTGCTGCGGAGTACTG); 
SEQ ID NO: 105 (5'CCTGATGAGGACTACTG); 
SEQ ID NO: 106 (5'GACATCCTGGAGGAGAAGC); 
SEQ ID NO: 107 (5'GCTCCTCCTCCAGGATGTC); 
SEQ ID NO: 108 (5'GACCTCCTGGAGGAGAAGC); 
SEQ ID NO: 110 (5'ATTACGTGCACCAGTTACG); 
SEQ ID NO: 112 (5'CTGCAGGGTCATGGGCCCCCG); 
SEQ ID NO: 114 (5'ATTACGTGTACCAGTTA); 
SEQ ID NO: 115 (5'CCTGAGGCGGAGTACTG); 
SEQ ID NO: 116 (5'GACCTCCTGGAGGAGGAG); 
SEQ ID NO: 117 (5'GACCTCCTGGAGGAGAGG); 
SEQ ID NO: 126 (5'CTGGACGAGGCCGTG); and 
SEQ ID NO: 131 (5'CCAGTACTCCTCATCAGGC). 



11. The process of Claim 10. wherein the panel of probes further comprises a probe with hybridizing region 
SEQ ID NO: 92 (5'CCTGATGAGGTGTACTG). 

12. The process of Claim 10, wherein the primers used are 

UG19 SEQ ID NO: 144 (GCTGCAGGAGAGTGGCGCCTCCGCTCAT) and 
UG21 SEQ ID NO: 145 (CGGATCCGGCCCAAAGCCCTCACTC). 



ia An oligonucleotide probe selected from the group consisting of: 

SEQ ID NO: 92 (5 CCTGATGAGGTGTACTG); 
SEQ ID NO: 132 (5'ATTACGTGCACCAGTTAC); 
SEQ ID NO: 133 (5'ATTACGTGCACCAGTTA); 
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SEQ ID NO: 134 (5'CAGTACTCCTCATCAG); 
SEQ ID NO: 135 (5'CTGATGAGGACTACTG); 
SEQ ID NO: 137 (5'CCGTCCCTGGAAAAGGTAATTC); 
SEQ ED NO: 138 (6'GACCTCCTGNGAGGAGAGGC); 
SEQ ID NO: 139 (5*GACCTCCTGGAGNGAGGAGC); 
SEQ ID NO: 151 (X-AGGAGTTCGCGCGCTT); 
SEQ ID NO: 152 (X-AGGAGTTCGTGCGCTT); 
SEQ ID NO: 153 (X-AGGAGCTCGTGCGCTTC); 
SEQ ID NO: 154 (X-CCGGCAGGAGTACGCGC); 
SEQ ID NO: 155 (X-GAGGAGTACGCGCGCT); 
SEQ ID NO: 156 (X-CGAGCTGGGCGGGCCCA); and 
SEQ ID NO: 157 (X-CGAGCTGGTCGGGCCCA). 

14. A primer selected from the group consisting of primers 

ABlll SEQ ID NO: 91 5'GGGATCCGAGAGTGGCGCCTCCGCTCAT, 
RS348 SEQ ID NO: 142 5'ACACAGGAAACAGCTATGACCATG, and 
RS349 SEQ ID NO: 143 5'CCAGGGTTTTCCC AGTCACGAC . 

15. An oligonucleotide probe as claimed in claim 13 as a diagnostic tool. 

16. A primer as claimed in claim 14 as a diagnostic tool. 

17. A panel of SSO probes, wherein the probes of the panel are selected from the group consisting of: 

SEQ ID NO: 92 (5'CCTGATGAGGTGTACTG); 
SEQ ID NO: 132 (5'ATTACGTGCACCAGTTAC); 
SEQ ro NO: 133 (5'ATTACGTGCACCAGTTA); 
SEQ ID NO: 134 (5'CAGTACTCCTCATCAG); 
SEQ ID NO: 135 (5'CTGATGAGGACTACTG); 
SEQ ID NO: 137 (5'CCGTCCCTGGAAAAGGTAATTC); 
SEQ ID NO: 138 (5'GACCTCCTGNGAGGAGAGGC); 
SEQ ID NO: 139 (5'GACCTCCTGGAGNGAGGAGC); 
SEQ ID NO: 151 (X-AGGAGTTCGCGCGCTT); 
SEQ ID NO: 152 (X-AGGAGTTCGTGCGCTT); 
SEQ ID NO: 153 (X-AGGAGCTCGTGCGCTTC); 
SEQ ID NO: 154 (X-CCGGCAGGAGTACGCGC); 
SEQ ID NO: 155 (X-GAGGAGTACGC(}CGCT); 
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SEQ ID NO: 156 (X-CGAGCTGGGCGGGCCCA); and 
SEQ ID NO: 157 (X-CGAGCTGGTCGGGCCCA). 

1& The use of an oligonucleotide probe as claimed in claim 13, a primer as claimed in claim 14, or a panel 
of SSO probes as claimed in claim 17 for determining an individual's HLA DP genotype. 

19. The use of an oligonucleotide probe as claimed in claim 13. a primer as claimed in claim 14. or a panel 
of SSO probes as claimed in claim 17 for detemiining an Individual's HLA DP genotype, wherein the 
individual's HLA DP genotype comprises an allele selected from the group consisting of DPB21 
DPB22, DPB23. DPB24, DPB25. DPB26, DPB27, DPB28, DPB29, and DPB30. 

20. A kit useful for determining an individual's HLA DP genotype comprising: 

(a) a panel of SSO probes, wherein the probes of the panel are selected from the group consisting 

SEQ ID NO: 92 (5'CCTGATGAGGTGTACTG); 
SEQ ID NO: 132 (5'ATTACGTGCACCAGTTAC); 
SEQ ID NO: 133 (5'ATTACGTGCACCAGTTA); 
SEQ ID NO: 134 (5'CAGTACTCCTCATCAG): 
SEQ ID NO: 135 (5'CTGATGAGGACTACTG); 
SEQ ID NO: 137 (5'CCGTCCCTGGAAAAGGTAATTC); 
SEQ ID NO: 138 (5 GACCTCCTGNGAGGAGAGGC); 
SEQ ID NO: 139 (5'GACCTCCTGGAGNGAGGAGC); 
SEQ ID NO: 151 (X-AGGAGTTCGCGCGCTT); 
SEQ ID NO: 152 (X-AGGAGTTCGTGCGCTT); 
SEQ ID NO: 153 (X-AGGAGCTCGTGCGCTTC); 
SEQ ID NO: 154 (X-CCGGCAGGAGTACGCGC); 
SEQ ID NO: 155 (X-GAGGAGTACGCGCGCT); 
SEQ ID NO: 156 (X-CGAGCTGGGCGGGCCCA); and 
SEQ ID NO: 157 (X-CGAGCTGGTCGGGCCCA); 

as well as 

(b) instructions for determining the genotype by utilizing kit ingredients. 

!1. The kit of Claim 20, wherein the Indlviduars HLA DP genotype comprises an allele selected from the 
group consisting of DPB21, DPB22, DPB23. DPB24, DPB25, DPB26, DPB27 DPB28 DPB29 and 
DPB30. ' ' 

!2. The kit of Claim 20 further comprising a container containing oligonucleotide primers useful for 
amplification of a target region of an HLA DP gene, wherein said target region contains said variable 
segment. 
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